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Description 

Field of the Invention 

0 [0001] The present Invention relates to the construction and the method of computation of an Information processing 
system for greatly Improving processing performance by means of parallel processors and, more specifically, to an 
increase of speed In numerical calculations of differential equations. 

Description of the Background Art 

10 

[0002] In the solution of differential equations which Is the main field of application of the present invention, the object 
equation is solved with the calculus of finite differences through Interaction of mutually adjacent lattice points in analytic 
time and space which Is discrete in space and time. In the past, those problems have been analyzed by the method 
of either sequential calculation by Neumann's computer or parallel calculation on multi-processor computers. 
[0003] Moreover, in recent years, large-scale estimation by numerical analysis of heat and fluid is being made In 
weather forecasting or environmental assessment of atmospheric & ocean pollution, for example. I n weather forecast- 
ing, a weather report Is issued by estimating the movement of atmosphere such as movement of high & low air pres- 
sures, course of typhoon, etc. based on the air pressure, wind velocity, atmospheric temperature, solar radiation, etc. 
In environmental assessment, the way the source of pollution Is diffused by air currens or tidal currents is evaluated 

20 through estimation of the current velocity and the concentration of polluting matters. A large number of supercomputers 
are put into operation for such jobs and a great amount of time is spent for computation. In this way, a lot of computer 
resources have been put Into operation and great efforts have so far been made to Improve the computing speed and 
the accuracy of estimation, prevent disasters due to winds and rains and to prevent destruction of environments. Meth- 
ods of dealing wtth this kind of problems on multi-processor machines are being tried and, as shown in the collection 

26 of papers No. 558, vol . 59, pp. 427-432 (literature 1 ) of the Japanese Society of Mechanical Engineers and the collection 
of papers No. 669, vol. 60, pp. 99-106 (literature 2) of the Japanese Society of Mechanical Engineers, use is made of 
the domain decomposition method which consists in dividing a discrete analytic area into blocks and assigning the 
respective split areas to single element processors so as to perform parallel processing by means of a multi-processor 
system which consists of multiple element processors. 

&> [0004] Fig. 66 Is a general construction drawing of the system for performing the domain decomposite process by 
combining element processors by means of a near-neighbor mesh indicated In literature 1 which Is an example of prior 
art. In the drawing, 201 is an element processor acting on a single assigned subdomain, 202 Is a connection between 
element processors for data communication between element processors 201 , 203 is the host computer for integrally 
control Dng the element processors 201 , 204 is an interface board for connecting the host computer 203 and the dement 

35 processors 201 , and 205 is a transputer board on which 4 element processors 201 are mounted. 

[0005] Moreover, in the example of prior art in literature 2, a parallel general purpose computer AP1000 is used and 
the element processors 201 are connected by means of a similar near-neighbor mesh. 

[0006] In the parallel computation method utilizing those multi-processor systems for general use, low-speed com- 
munication such as packet communication, etc. is used for the communication between processors. Moreover, because 
40 synchronization of processing Is necessary between processors, there exists an overtiead at the time of data transfer 
between element processors 201 and this limits the efficiency of parallel computation as the number of element proc- 
essors 201 increases. 

[0007] Fig. 69 Is a drawing showing the efficiency saturation in parallel computation due to the increase in the number 
of element processors in the case of computational fluid dynamics which is an example of solving partial differential 

4* equations as indicated in literature 2 and user the finite difference method. In the drawing, abscissa indicates the 
number of element processors 201 , the ordinate Indicates the Improvement ratio of operation speed , the mark O is the 
operation speed Improvement in a case where the MAC (Marker And Cell) method is used, the mark A is the operation 
speed improvement In the case of we of the SMAC (Simplified MAC) method, the marie D is the operation speed 
Improvement of a case where the HSMAC (Highly Simplified MAC) method is used, and the solid line Indicates the 

so operation speed improvement proportional to the number of element processors 201. In the MAC method, SMAC 
method and HSMAC method, the increase in efficiency of parallel computations slows as the number of element proc- 
essors 201 Increases due to the overhead of the data transfer. 

[0008] Moreover, to look at the possibility of parallel operations in the analysis of partial differential equations, it was 
not sufficient for the conventional numerical analysis using existing hardware to incorporate parallel operations into 
as the analysis of partial differential equations, because of the emphasis put on the universality of the hardware architec- 
ture. 

[0009] In other words, even by using supercomputers or parallel computers for general use, it was difficult to analyze 
large-scale partial differential equations rapidly and yet at low cost 
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[0010] Because the conventional solving system of differentia] equations was a process as described above, the 
main stream In the solution of partial differential equations has been the domain decomposition method using parallel 
processors for general use such as nCUBE, AP-1000, Transputer, eta For the communication between those multi- 
processors, low-speed communication such as packet communication, etc. is used. As far as the existing hardware 

0 architecture is retained, there Is no choice but attach much importance to parallel and/or concurrent processing between 
processors. Namely, while the solution by domain decomposition method was effective to some extent for the parallel 
processing of coarse grain parallel processing in which the operation time represents a larger proportion compared 
with the communication time in the entire time required for computation, it had a problem that the overhead at the time 
of data transfer increases and this causes a drop of efficiency of parallel computation in the fine grain parallel processing 

10 in which the computation time represents a larger proportion compared with the operation time appearing in the analysis 
of differential equations. 

[001 1] Moreover, while various methods are devised for raising the latency tolerance (countermeasure against the 
problem of waiting for data) by improving software, a problem was that it was impossible to radically change the com- 
munication system as far as the existing hardware is used and therefore there was no way to sharply decrease the 

18 waiting time for synchronization of communication. 

[0012] Furthermore, another problem was that the parallel processing on the level of operation in the processor 
depends on the architecture of the hardware and no effective measure can be taken on the software. 
[0013] Further Information relating to the priorart can be found in MICROPROCESSING & MICROPROGRAMMING, 
APRIL 1967, NETHERLANDS, ISSN 0165-6074, pages 113-118, SIVA RAM MURTHY C ET AL "A multi-mi croproc- 

20 essor architecture for solving partial differential equations," which presents an architecture of a fault-tolerant, special- 
purpose multi-microprocessor system for solving partial differential equations. The modular nature of the architecture 
allows the use of hundreds of processing elements (PPs) for high throughput The mlcroporcessor system consists of 
identical PE's connected by a two-dimensional bused-network and two bus control units (BCU's). The communications 
are either "point-to-point 11 or "broadcast." A PE consists of one arithmetic processing unit and a communication proo- 

25 essor. There is no shared global memory. The PE's comprise program and data memory. 

[001 4] The first object of the present invention is to provide an information processing system capable of improving 
latency tolerance by asynchronously transmitting messages between element processors. 
[001 6] Moreover, the second object of the present invention Is to provide an information processing system capable 
of realizing a memory access system suitable to the solution of differential equations by having a memory structure 

30 protected against any disturbance to the processing inside the element processors by communication. 

[001 6] Furthermore, the third object of the present Invention is to provide an Information processing system capable 
of fully demonstrating the parallel processing on the level of operation Inherent in the method of numerical analysis 
based on thermal hydraulic numerical analysis and other methods of solution of partial differential equations, by ar- 
ranging computing elements realized by directly turning the wiring logic into hardware in the processors. 

3* [0017] To achieve these and other objects, the present invention provides an information processing system in ac- 
cordance with independent claim 1 as weD as a corresponding method of computation as defined in Independent claim 
30. Preferred embodiments of the invention are detailed in the dependent claims, 

[0018] The aim and advantages of the various features of the Invention are discussed hereinbeiow with reference 
to various aspects of the invention. 
40 [0019] First and second aspects of the invention are defined by claims 1 and 5, respectively. Twenty seventh and 
twenty eighth aspects of the invention are defined in claims 30 and 34, respectively. 

[0020] According to the third aspect of the present invention, in the information processing system of the second 
aspect, a plural number of two-dimensional planes, in which the element processors performing the processing of the 
lattice points obtained by spatially dividing a two-dimensional field are two-dimensJonally arranged, are arranged along 
40 the time base to be constructed three-dimensionally, when a two-dimensional analytical solution is looked for, and the 
element processors are constructed in pipeline in the direction of time base. 

[0021] According to the fourth aspect of the present invention, the information processing system of the first or second 
aspect further comprises transmission means for transmitting initial conditions and boundary conditions to the element 
processors. 

60 [0022] According to the fifth aspect of the present Invention, in the Information processing system of the first or 
second aspect, data communications between the adjacent element processors are performed individually for each of 
the element processors, asynchronously and repeatedly. 

[0023] According to the sixth aspect of the present invention, in the information processing system of the first or 
second aspect, data communications between the adjacent element processors are message communications and 
& performed with multiple Instruction stream and multiple data stream. 

[0024] According to the seventh aspect of the present invention, in the information processing system of the first or 
second aspect, the element processors can directly refer, respectively, to the data memory of the adjacent element 
processors. 
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[0025] According to the eighth aspect of the present Invention, in the information processing system of the seventh 
aspect, the element processors can directly refer to and renew, respectively, the data memory of the adjacent element 
processors. 

[0026] According to the ninth aspect of the present Invention, in the information processing system of the first or 
second aspect, the data memories include, on the routes connecting the adjacent element processors, first memories 
shared by the adjacent element processors and can directly refer to and renew the content stored in the first memories 
from a plural number of the element processors connected to the first memories. 

[0027] According to the tenth aspect of the present invention, In the information processing system of the seventh 
aspect, the data memories include multiport memories. 

[0028] According to the eleventh aspect of the present invention, the information processing system of the first or 
second aspect further comprises buffer means for temporarily buffering access demand to the data memories. 
[0029] According to the twelfth aspect of the present Invention, in the information processing system of the first or 
second aspect, the element processors respectively have buffer memories to which data sent from the adjacent element 
processors are written as data with object name In communications between the adjacent element processors, and 
the element processors complete reception of data from the adjacent element processors by searching data with object 
name at optional timing from the buffer memories. 

[0030] According to the thirteenth aspect of the present invention, in the information processing system of the first 
or second aspect, the element processors respectively have matching memories for waiting for the data necessary for 
at least one of the computing operations, and the processing unit in each of the element processors performs at least 
one prescribed operation among the computing operations at the point in time sequence when the necessary data are 
available in the matching memory In the same. 

[0031] According to the fourteenth aspect of the present invention, in the Information processing system of the first 
or second aspect, Hie element processors respectively have program memories available for rewriting the program 
stipulating the operations of the processing units. 

[0032] According to the fifteenth aspect of the present invention, in the Information processing system of the first or 
second aspect, each of the processing units works based on a data flow system which enables simultaneous execution 
of a plural number of the program assigned to different areas of each of the program memories, and each of the 
processing units executes the computing operations of a plural number of the lattice points simultaneously by assigning 
a plural number of programs for processing the respective lattice points to different areas of each of the program 
memories. 

[0033] According to the sixteenth aspect of the present Invention, In the Information processing system of the fifteenth 
aspect, the data flow system Includes a dynamic data driven system capable of simultaneously executing one same 
program on input data of a plural number of sets with the use of identifiers. 

[0034] According to the seventeenth aspect of the present invention, in the information processing system of the first 
or second aspect, the processing units respectively have hardware circuits in which arithmetic-logic elements are wired 
in the shape based on a data flow graph according to the wired-iogic for part or whole of the data processing flow 
[0035] According to the eighteenth aspect of the present Invention, In the Information processing system of the first 
or second aspect, the processing units respectively have hardware circuits according to one same wired -logic, the 
hardware circuits load In advance the data memories of the element processors with internal point bits indicating that 
the lattice points are positioned Inside a computing area In the field or outside the object in the computing area with 
respect to all of the lattice points, with the external point btta indicating that the lattice points are positioned outside the 
computing area or inside an object in the computing area and with boundary condition bits indicating information of 
boundary conditions with respect to the lattice points positioned on the boundary of the computing area and the hard- 
ware circuits change the processing of the data flow graph according to the internal point bits, the external point bits 
and the boundary condition bits. 

[0036] According to the nineteenth aspect of the present invention, in the information processing system erf the sev- 
enteenth aspect, the processing units further respectively have arithmetic devices for directly executing operation codes 
for cumulatively adding the respective results of multiplication of first input by second input, third input by fourth input, 
fifth Input by sixth Input, .... (2 x N - 1)-th Input by (2 x N)-th Input to 2N pieces of Input data. 
[0037] According to the twentieth aspect of the present Invention, the information processing system of the first or 
second aspect, further comprises fine node couplings connecting the adjacent element processors to each other and 
coarse node couplings connecting the element processors to each other by skipping no less than one the element 
processors between them. 

[0030] According to the twenty-first aspect of the present invention, in the information processing system of the first 
or second aspect, the element processors respectively have output units for asynchronously reading output data, 
furnishing that output data with identifiers necessary for arranging and sorting of data and sending data to outside the 
element processors. 

[0039] According to the twenty-second aspect of the present Invention, the Information processing system of the 
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twenty-first aspect, further comprises output data image processing unit for collecting output data from the output units 
of the element processors and generating visible Images of the output data in real time based on the output data arriving 
asynchronously from all of the element processors. 

[0040] According to the twenty-third aspect of the present invention, the Information processing system of the first 
or second aspect, farther comprises coordinate conversion urtft for converting an first coordinate system used by a 
host computer controlling the element processors to a second coordinate system used by the element processors 
when data are transferred from the host computer to the element processors, and converting the second coordinate 
system used by the element processors to the first coordinate system used by the host computer when data are trans- 
ferred from the element processors to the host computer. 

[0041] According to the twenty-fourth aspect of the present invention, in the information processing system of the 
first or second aspect, the plural number of element processors Integrally controlled by a host computer respectively 
comprise first convergence Judging units, provided in each of the element processors and independently of the process- 
ing units, forjudging convergence wtthout disturbing the computation of the processing units, and the Information 
processing system further comprising: first communication routes tor dividing a plural numberof the element processors 
into a plural number of groups and connecting the first convergence judging units of the respective element processors 
in the form of a unidirectional closed loop or in a centralized form in the respective groups; second convergence judging 
units, connected to the first communication routes respectively, for integrating the results of judgment of convergence 
of all the first convergence judging units in the groups; and second communication routes for connecting all of the 
second convergence Judging units of the groups in the form of a unidirectional closed loop or In a centralized form. 
[0042] According to the twenty-fifth aspect of the present invention , the Information processing system of the first or 
second aspect, further comprises: at least one route for transferring messages of a plurality of the element processors; 
first convergence Judging means forjudging convergences on the element processors or no less than one processing 
means assigned to each of the element processors; first message sending means for sending a first message indicating 
the convergence on a prescribed element processor or the processing means of the prescribed element processor 
when the first convergence judging means Judges the conve rgence on the prescribed element processor or the process- 
ing means of the same; second message sending means for sending a second message indicating the convergence 
on the element processor or the processing means receiving the first or second message from the adjacent element 
processor or the adjacent element processor after the element processor or the processing means receiving the first 
or second message; and second convergence Judging means for Judging the convergence of all the element processors 
by transferring the second message one after another through the at least one route by the second sending means. 
[0043] According to the twenty-sixth aspect of the present Invention, in the Information processing system of the 
twenty-fifth aspect, the at least one route includes a plurality of different routes connecting the processing means, and 
the second convergence Judging means confirms the convergence of all the element processors In multiplex ways by 
the messages transferred one after another through the plurality of different routes. 

[0044] The present invention is also directed a method of computation performed by using an information processing 
system. 

[0045] According to the twenty-ninth aspect of the present Invention, In the method of the twenty-eighth aspect, a 
plural number of two-dimensional planes, In which the element processors performing the processing of the lattice 
points obtained by spatially dividing a two-dimensional field ere two-dimensionaDy arranged, are arranged along the 
time base to be constructed three-dimenslonally, when a two-dimensional analytical solution Is looked for, and the 
element processors are constructed in pipeline In the direction of time base. 

[0046] According to the thirtieth aspect of the present invention, in the method of the twenty-seventh or twenty-eighth 
aspect, the at least one prescribed equation includes at least one differential equation, and the at least one prescribed 
equation obtains the solution to the at least one differential equation by operating the element processors asynchro- 
nously and independently by finite difference method. 

[0047] According to the thirty-first aspect of the present Invention, In the method of the thirtieth aspect, the at least 
one prescribed differential equation Includes at least one partial differential equation or at least one ordinary differential 
equation. 

[0048] According to the thirty-second aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the element processors take charge of computation of a plural number of lattice points obtained by 
spatially dividing the field respectively, and the number corresponding to the lattice points or the base address of the 
data area corresponding to the lattice points on the data memories are used as input tokens to control the computation 
performed by the element processors. 

[0049] According to the thirty-third aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the Information processing system performs the computation by using transmission means for trans- 
mitting initial conditions and boundary conditions to the element processors. 

[0050] According to the thirty-fourth aspect of the present invention , in the method of the twenty-seventh or twenty- 
eighth aspect, data communications between the adjacent element processors are performed Individually for each of 
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the element processors, asynchronously and repeatedly. 

[0051] According to the trdrty-flflh aspect of the present Invention, in the method of the twenty-seventh or twenty- 
eighth aspect, data communications between the adjacent element processors are message communications and 
performed with multiple Instruction stream and multiple data stream. 
6 [0052] According to the thirty-sixth aspect of the present invention, In the method of the twenty-seventh or twenty- 
eighth aspect, the element processors directly refer, respectively, to the data memories of the adjacent element proc- 
essors. 

[0053] According to the thirty-seventh aspect of the present invention, in the method of the thirty-sixth aspect, the 
element processors directly refer to and renew, respectively, the data memories of the adjacent element processors. 
10 [0054] According to the thirty-eighth aspect of the present Invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the data memories include, on the routes connecting the adjacent element processors, first memories 
shared by the adjacent element processors and directly refer to and renew the content stored in the first memories 
from a plural number of the element processors connected to the first memories. 

[0055] According to the thirty-ninth aspect of the present Invention, in the method of the thirty-sixth aspect, the data 
Jfi memories Include multiport memories. 

[0056] According to the fortieth aspect of the present invention, in the method of the twenty-seventh or twenty-eighth 
aspect, the Information processing system performs the computation by using buffer means for temporarily buffering 
access demand to the data memories. 

[0057] According to the forty-first aspect of the present invention, in the method of the twenty-seventh or twenty- 
mo eighth aspect, the element processors respectively comprise buffer memories to which data sent from the adjacent 
element processors are written as data with object name In communications between the adjacent element processors, 
and the element processors complete reception of data from the adjacent element processors by searching data with 
object name at optional timing from the buffer memories. 

[0058] According to the forty-second aspect of the present invention, In the method of the twenty-seventh or twenty- 
25 eighth aspect, the element processors respectively comprise matching memories for waiting for the data necessary 
for at least one of the computing operations, and the processing unit in each of the element processors performs at 
least one prescribed operation among the computing operations at the point in time when the necessary data are 
available in the matching memory In the same. 

[0059] According to the forty-third aspect of the present Invention, In the method of the twenty-seventh or twenty- 
mo eighth aspect, the element processors respectively comprise program memories available for rewriting the program 
stipulating the computing operations of the processing units. 

[0050] According to the forty-fourth aspect of the present Invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the processing units perform the computing operations by using one same computation program, the 
computation program loads In advance the data memories of the element processors with internal point bits indicating 

35 that the lattice points are positioned inside a computing area in the field or outside the object in the computing area 
with respect to all of the lattice points, with the external point bits indicating that the lattice points are positioned outside 
the computing area or Inside an object In the computing area and with boundary condition bits Indicating information 
of boundary conditions with respect to the lattice points positioned on the boundary of the computing area, and the 
computation program changes the processing on the element processors according to the internal point bits, the ex- 

40 temal point bits and the boundary condition bits. 

[0081] According to the forty-fifth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, each of the processing units works based on a data flow system which enables simultaneous execution 
of a plural number of the program assigned to different areas of each of the program memories, and each of the 
processing units executes the computing operations of a plural number of the lattice points simultaneously by assigning 

45 a plural number of programs for processing the respective lattice points to different areas of the program memories. 
[0062] According to the forty-sixth aspect of the present Invention, in the method of the forty-fifth aspect, the data 
flow system Includes a dynamic data driven system capable of simultaneously executing one same program on input 
data of a plural number of sets with the use of identifiers. 

[0083] According to the forty-seventh aspect of the present Invention, In the method of the forth-slxth aspect, the 
so processing units calculate the address of the data memories which store data regarding the adjacent element proces- 
sors and make reference to and renewal of the data memories by using the address of the data memories, based on 
the value of the Identifiers assigned to the Input data. 

[0084] According to the forty-eighth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the processing units respectively have hardware circuits in which arithmetic-logic demerits are wired 
55 in the shape based on a data flow graph according to the wired-logic for part or whole of the data processing flow 
[0065] According to the forty-ninth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the processing units respectively have hardware circuits according to one same wired-logic, the hard- 
ware circuits load In advance the data memories of the element processors with internal point bits Indicating that the 
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lattice points are positioned inside a computing area in the field or outside the object in the computing area with respect 
to all of the lattice points, with the external point bits Indicating that the lattice points are positioned outside the computing 
area or inside an object In the computing area, and with boundary condition bits indicating information of boundary 
conditions with respect to the lattice points positioned on the boundary of the computing area, and the hardware circuits 
change the processing of the data flow graph according to the Internal point bits, the externa) point bits and the boundary 
condition bits. 

[0086] According to the fifties aspect of the present invention, in the method of the fbrth-eighth aspect, the processing 
units respectively have arithmetic devices for directly executing operation codes for cumulatively adding the respective 

results of multiplication of first Input by second input, third input by fourth input, fifth input by sixth Input (2 x N - 

1)-th input by (2 x N)-th input to 2N pieces of input data. 

[0067] According to the fifty-first aspect of the present Invention, in the method of the forty-eighth aspect, the infor- 
mation processing system levels the number of times of communications at all of the lattice points by performing dummy 
communications even for the lattice points with smaD number of times of communications for solving the at least one 
governing equation of the field because the lattice points forming the subject of computation are found on the border 
of areas. 

[0088] According to the fifty-second aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the Information processing system levels the number of times of operations at all of the lattice points by 
performing dummy operations even for the lattice points with small number of times of operations for solving the at 
least one governing equation of the field because the lattice points forming the subject of computation are found on 
the border of areas. 

[0089] According to the fifty-third aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the Information processing system which performs processing of each of the lattice points for finding 
solution to at least one differential equation by finite difference method and performs communications between the 
respective lattice points, Information Indicating the necessity or not of communication and the method of communication 
for each of the lattice points. 

[0070] According to the fifty-fourth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the information processing system which performs processing of each of the lattice points for finding 
solution to at least one differential equation by finite difference method assigns a memory area to each of the lattice 
points, performs communications between Information of adjacent lattice points by the memory area end provides each 
of the lattice points with Information for indicating the memory area to be used for communication in each direction 
where communication should be made. 

[0071] According to the fifty-fifth aspect of the present invention, in the method of the fifty-fourth aspect, the infor- 
mation processing system provides each of the lattice points with information indicating a common dummy area in the 
direction where communication is Impossible. 

[0072] According to the fifty-sixth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, when the adjacent lattice points are assigned over different the element processors, the element proc- 
essors on both sides hold two pairs of data In time for intercommunication and, when the data for those lattice points 
has been renewed, the processors do not rewrite the data If any displacement of the renewed data is no larger than 
the prescribed value In comparison with the data of the chronologically previous pair. 

[0073] According to the fifty-seventh aspect of the present invention, In the method of the fifty-fourth aspect, the 
information processing system assigns, to the data memory, virtual data area simulating lattice points which are closest 
to the bonder of the computation area forming the subject of the calculation of the at least one prescribed equation and 
found outside the computation area. 

[0074] According to the fifty-eighth aspect of the present Invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the information processing system comprises fine node coupling connecting mutually adjacent the el- 
ement processors to each otherand coarse node coupling connecting the element processors to each other by skipping 
no less than one the element processors between them. 

[0075] According to the fifty-ninth aspect of the present invention, in the method of the fifty-eighth aspect, the infor- 
mation processing system advances computation of the at least one prescribed equation by alternately performing 
computation by the element processors connected by the coarse node coupOng and computation by the element proc- 
essors connected by the fine node coupling. 

[007q According to the sixtieth aspect of the present invention, in the method of the twenty-seventh or twenty-eighth 
aspect, the Information processing system performs calculation of the at least one governing equation of the field using 
only part of the lattice points selected, in parallel with the calculation of the at least one governing equation of the field 
using all of the lattice points. 

[0077] According to the sixty-first aspect of the present invention, In the method of the twenty-seventh or twenty- 
eighth aspect, the algorithm of the calculation of the at least one governing equation of the field using only selected 
ones of the lattice points Is different from the algorithm used for the calculation of the at least one prescribed governing 
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equation of the field using eD of the lattice points. 

[0078] According to the sixty-second aspect of the present invention, in the method of the sixty-first aspect, the 
information processing system has no less than two pairs of data regarding mutually adjacent the lattice points in time 
sequence and stops the operation of the lattice points if the amount of fluctuation between the pairs of data is no larger 
than the prescribed amount 

[0079] According to the sixty-third aspect of the present invention, in the method of the sixty-second aspect, the 
value of the prescribed amount for omitting the operation of the lattice points depending on the value of fluctuation 
between the pairs of data is reduced according to the grade of convergence. 

[0080] According to the sixty-fourth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the element processors respectively have output units for asynchronously reading output data, furnishing 
that output data with Identifiers necessary for arranging and sorting of data and sending data to outside the element 
processors. 

[0081] According to the sixty-fifth aspect of the present Invention, In the method of the sixty-fourth aspect, the infor- 
mation processing system comprises output data image processing unit for collecting output data from the output units 
of the element processors and generating visible images of the output data in real time based on the output data arriving 
asynchronously from ell the element processors. 

[0082] According to the sixty- sixth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the information processing system comprises coordinate conversion unit for converting an first coordi- 
nate system used by a host computer controlling the element processors to a second coordinate system used by the 
element processors when data are transferred from the host computer to the element processors, end converting the 
second coordinate system used by the element processors to the first coordinate system used by the host computer 
when data are transferred from the element processors to the host computer. 

[0083] According to the sixty-seventh aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the plural number of element processors are a plural number of element processors integrally controlled 
by a host computer, the plural number of element processors comprise convergence Judging units, provided in each 
of the element processors and independently of the processing units respectively, for judging convergence without 
disturbing the computation of the processing unit, and the Information processing system comprises first communication 
routes for dividing a plural number of the element processors Into a plural number of groups and connecting the con- 
vergence judging units of the respective element processors In the form of a unidirectional closed loop or in a centralized 
form In the respective groups, and second communication routes connected to each of the first communication routes 
of the groups, and connecting all of the convergence judging units integrating the results of judgement of convergence 
of aO the convergence Judging units In the groups and all of the convergence Judging units of the groups in the form of 
a unidirectional closed loop or in a centralized form. 

[0084] According to the sixty-eighth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, the information processing system comprises at least one route for transferring messages of a plurality 
of the element processors. The method of computation further comprises steps of (a)judging convergences on each 
of the element processors or no less than one processing means assigned to each of the element processors; (b) 
sending a first message Indicating a prescribed element processor or a processing means of the prescribed element, 
when the Information processing system judges the convergence on the prescribed element processor or the process- 
ing means of the prescribed element; (c)sending a second message indicating the convergence on the element proc- 
essor or the processing means receiving the first or second message from the adjacent element processor or the 
adjacent element processor after the element processor or the processing means receiving the first or second message; 
and (drudging the convergence of all the element processors by transferring the second message one after another 
by repeating the step (c). 

[0085] According to the sixty-ninth aspect of the present invention, in the method of the sixty-eighth aspect, the 
Information processing system repeats the step (a) to step (d) for a plurality of confirmations of the convergence of all 
the element processors. 

[0086] According to the seventieth aspect of the present invention, in the method of the sixty-ninth aspect, the infor- 
mation processing system comprises second convergence judging means, the at least one route Includes a plurality 
of different routes connecting the processing means, and the second convergence Judging means confirms the con- 
vergence of all the element processors in multiplex ways by the messages transferred one after another through the 
plurality of different routes. 

[0087] According to the seventy-first aspect of the present invention, In the method of the seventieth aspect, the 
Information processing system forms Judgement of convergence again by stopping the processing of all the element 
processors or the processing means, after the completion of general convergence has been Judged by the processes 
which judge the general convergence. 

[0088] According to the seventy-second aspect of the present invention, the method of computation performed by 
using an information processing system having a plural number of element processors further comprises steps of: 
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initializing the storing means for storing the results of judgement of convergence; and Judging convergence or not of 
the computation for the lattice points for the respective lattice points taken charge by the element processors and, If It 
Is judged that convergence has not yet been reached, sets the storing means to the prescribed logical value. 
[0089] According to the seventy-third aspect of the present Invention, in the Information processing system of the 
first or second aspect, the plural number of element processors share the data memory between the adjacent element 
processors, and each of the plural number of element processors comprises input/output control unit commonly con- 
nected to a plural number of data memories shared with the adjacent element processors for controlling input/output 
of data with one of the plural number of data memories by switching the direction of communication according to 
demand for access. 

[0090] According to the seventy^Wrd aspect of the present invention, the information processing system of the first 
or second aspect, further comprises a plural number of registers provided in correspondence to the plural number of 
element processors for temporarily storing the results of judgement of a convergence of an operation executed by 
each of the plural number of element processors; and a wired OR circuit for generating logical product of the output 
of a plural number of the registers. 

[0091] According to the seventy-fifth aspect of the present invention, in the method of the twenty-seventh or twenty- 
eighth aspect, a plural number of the element processors share the data memory between the adjacent element proc- 
essors, each of the plural number of element processors comprises input/output control unit commonly connected to 
a plural number of data memories shared with the adjacent element processors for controlling input/output of data with 
one of the plural number of data memories by switching the direction of communication according to demand for access, 
and each of the pi ura I number of element processors refers to the content of the data memory through the Input/output 
control unit. 

[0092] According to the seventy-sixth aspect of the present invention, In the method of the twenty-seventh or twenty- 
eighth aspect, the information processing system temporarily stores, in the plural number of registers provided in cor- 
respondence to the plural number of element processors, the results of judgement of a convergence of the correspond- 
ing element processors, and executes operations while talcing the logical product of the results of judgement of the 
convergence stored in a plural number of the registers with the wired OR circuit 

[0093] According to the seventy-seventh aspect of the present Invention, in the method of the twenty-seventh or 
twenty-eighth aspect, the number of lattice points per one element processor including unstable lattice points for nu- 
merical analysis in which unstable factors exist is smaller than the number of lattice points per one element processor 
not Including any unstable lattice points for numerical analysis. 

[0094] By making data communications between adjacent element processors by means of the data buses of the 
first aspect or the twenty-seventh aspect and performing computation with that data by operating the element proces- 
sors asynchronously and Independently, it becomes possible to shorten the communication time between the element 
processors. 

[0095] As described above, according to the information processing system of the first aspect or the method of 
computation performed by using an information processing system of the twenty-seventh aspect, computation is per- 
formed, in an Information processing system for performing computation by assigning a plural number of lattice points 
obtained by spatially dividing a Meld to element processors, by having a plural number of data buses for having mutual 
communications of data between adjacent element processors and by using data transmitted by data buses and by 
operating the element processors asynchronously and Independently, and this has an effect of making it possible to 
shorten the communication time between the element processors and thus increase the processing speed of the in- 
formation processing system. 

[0096] By making data communications between adjacent element processors by means of the data buses of the 
second aspect or the twenty-eighth aspect and performing computation with that data by operating the element proc- 
essors asynchronously and independently, it becomes possible to shorten the communication time between the ele- 
ment processors. 

[0097] According to the information processing system of the second aspect or the method of computation performed 
by using an Information processing system of the twenty-eighth aspect, computation is performed, in an information 
processing system for performing computation by providing a plural number of lattice points In a way to spatially divide 
a field and correspond to a plural number of time steps, by further having a plural number of data buses for having 
mutual communications of data between adjacent element processors and by using data transmitted by data buses 
and by operating the element processors asynchronously and Independently, 8nd this has an effect of making it possible 
to shorten the communication time between the element processors and thus increase the processing speed of the 
information processing system. 

[0098] The two-dimensional planes of the third aspect or the twenty-ninth aspect, In which element processors per- 
forming the processing of lattice points obtained by spatially dividing a two-dimensional field are two-dimensionally 
arranged, can shorten the communication time between the element processors and can shorten the computation time 
by performing computation on a plural number of times in parallel and designing the element processors In pipeline 
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along the direction of time base. 

[0099] According to the information processing system of the third aspect or the method of computation performed 
by using an information processing system of the twenty-ninth aspect, two-dimensional planes are arranged along the 
time base to be constructed three-dimenslonaOy and the dimension In the direction of time base is constructed In 
pipeline, and this has an effect of making it possible to shorten the communication time between the element processors 
and thus increase the processing speed of the information processing system. 

[0100] in the case where the prescribed equation in the thirtieth aspect is a differential equation and that the solution 
to the differential equation is obtained by using finite difference method, it is possible to obtain analytical solution of 
high accuracy because the influence of the lattice points adjacent to the lattice points forming the subject of the com- 
putation is very small. 

[0101] According to the method of computation performed by using an Information processing system of the thirtieth 
aspect, solution to at least one differential equation is obtained by operating element processors asynchronously and 
Independently by difference calculus, and this has an effect of making It possible to shorten the processing time of the 
Information processing system without sacrificing the accuracy of analysis. 

[0 1 02] In the partial differential equation or ordinary differential equation of the thirty-first aspect, analytical solution 
of high accuracy can be obtained because the influence of the lattice points adjacent to the lattice points forming the 
subject of the computation is negligible when the solution to the differential equation is obtained by using finite difference 
method. 

[01 03] According to the method of computation performed by using an information processing system of the thirty- 
first aspect, solution to at least one partial differential equation or at least one ordinary differential equation is obtained 
by operating element processors asynchronously and independently by difference caiculus, and this has an effect of 
making It possible to shorten the processing time of the Information processing system without sacrificing the accuracy 
of analysis. 

[0 1 04] The element processors In the thirty-second aspect take charge of computation of a plural number of lattice 
points obtained by spatially dividing the field respectively. For that reason, even mutual adjacent lattice points include 
some which extend over two element processors and some which do not By controlling the operation of the element 
processors by using either the number corresponding to the lattice points or the base address of the data area corre- 
sponding to the lattice points as Input tokens to transfer data between adjacent lattice points existing in different element 
processors, it becomes to perform data transfer between the element processors by directly having access to the data 
memory on the adjacent element processors, eliminate data transfer overhead between lattices and perform high- 
speed analysis by asynchronous control. 

[0106] According to the method of computation performed by using an information processing system of the thirty- 
second aspect, the operation of the element processors is controlled by using, as input tokens, either the number 
corresponding to the lattice points or the base address of the data area corresponding to the lattice points on the data 
memory, and this has an effect of making it possible to directly have access to the data memory on the adjacent element 
processor and perform high-speed analysis by asynchronous control by eliminating data transfer overhead between 
lattices. 

[0 1 06] The transmission means according to the fourth aspect or the thirty-third aspect, If initial conditions and bound- 
ary conditions of computation to element processors are input for the host computer, resets the data memory on all 
element processors, assigns the Initial conditions and boundary conditions of computation to element processors to 
the respective element processors through the network and the connection between element processors. And the 
respective element processors get into the standby state for starting computation. 

[01 07] The information processing system of the fourth aspect or the method of computation performed by using an 
information processing system of the thirty-third aspect, which have transmission means for transmitting initial condi- 
tions and boundary conditions to the processing unit, produce an effect of making it possible to realize computation of 
various initial conditions and boundary conditions with a simple procedure of Inputting conditions in the host computer, 
by using one same construction of element processors. 

[0108] By performing data communications between data processing units of the adjacent element processors in 
the filth aspect or the thirty-fourth aspect individually for each processing unit, asynchronously and repeatedly, it be- 
comes possible to have access to the data memory and continue the processing on the respective element processors 
utterly asynchronously without disturbing the processing in the adjacent element processors. Namely, communication 
can be established between adjacent element processors without disturbing the computation In the adjacent element 
processors. 

[0109] According to the Information processing system of the fifth aspect or the method of computation performed 
by using an Information processing system of the thirty-fourth aspect, data communications between adjacent element 
processors are performed Individually for each processing unit, asynchronously and repeatedly, making it possible to 
continue the processing quite asynchronously by having access to the data memory without disturbing the processing 
of the adjacent element processor and make communication without disturbing the operation of the adjacent element 
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processor, end this has an effect of increasing the speed of operation. 

[0110] By performing the communication between adjacent element processors in the sixth aspect or the thirty-fifth 
aspect by message communication and with multiple instruction stream and multiple data stream, it becomes possible 
to Improve the efficiency of parallel processing. 

[01 1 1] According to the Information processing system of the sixth aspect or the method of computation performed 
by using an Information processing system of the thirty-fifth aspect, communications between adjacent element proc- 
essors are message communications and performed with multiplex command flow and multiplex data flow, end this 
has an effect of increasing the processing speed of the information processing system by improving the efficiency of 
parallel processing. 

[0112] In the element processors of the seventh aspect or the thirty-sixth aspect, the access to the data memory 
from adjacent element processors can be limited to the reference action only by directly referring, respectively, to the 
data memory of the adjacent element processors, and this can be done easily by each processing unit asynchronously 
and repeatedly, in the data communication between processing units of adjacent element processors. 
[0113] According to the information processing system of the seventh aspect or the method of computation performed 
by using an information processing system of the thirty-sixth aspect, the element processors directly refer, respectively, 
to the data memory of adjacent element processors and data communications between adjacent element processors 
can be performed easily without collision of requests for renewal to one same memory address for each processing 
unit, asynchronously and repeatedly, and this has an effect of increasing the speed of processing of the information 
processing system by simplifying the arbitration of data buses for memory access. 

[0114] The element processors of the eighth aspect or the thirty-seventh aspect can directly refer to and renew the 
data memory of adjacent element processors, and this makes it possible to sharply improve the efficiency of compu- 
tation and complete the desired operations In a short time by allowing different element processors to directly have 
access to the data memory of others mutually, regarding the data of the adjacent lattices points the processing of which 
is to be taken charge by the respective element processors. 

[01 1 5] According to the information processing system of the eighth aspect or the method of computation performed 
by using an information processing system of the thirty-seventh aspect, the element processors directly refer to and 
renew, respectively, the data memory of the adjacent element processors, and this has an effect of reducing overhead 
for data communications between adjacent element processors and improving the computation efficiency. 
[0116] A plural number of element processors connected to the memory of the ninth aspect or the thirty-eighth aspect 
can refer to and renew the content of memory stored in the memory, and this memory is shared by the adjacent element 
processors on the route connecting adjacent element processors. This makes it possible for improve the efficiency of 
computation by directly referring to and renewing the data of adjacent lattice points in the memory through I/O control 
unit, etc., for example. 

[01 17] According to the information processing system of the ninth aspect or the method of computation performed 
by using an information processing system of the thirty-eighth aspect, a memory shared by adjacent element processors 
is provided on the route connecting the adjacent element processors and the content of memory stored in the memory 
can be directly referred to and renewed from a plural number of element processors connected to the memory, and 
this has an effect of Increasing the speed of processing of the Information processing system by sharply improving the 
computation efficiency and shortening the computation time. 

[01 18] Since the data memory in the tenth aspect or the thirty-ninth aspect is a multiport memory, it becomes possible 
to simultaneously process memory reference commands given to the multiport memory by the element processor 
concerned and the adjacent element processors. 

[01 1 9] According to the information processing system of the tenth aspect or the method of computation performed 
by using an information processing system of the thirty-ninth aspect, the data memory is a multiport memory, and this 
has an effect of easily realizing high-speed operation of the information processing system by eliminating stagnation 
of processing, sharply contributing to the shortening of computation time In processings involving collision of references 
to one same memory address and enabling simultaneous processing of requests for reference to memory made to the 
muitlport memory by the element processor concerned and the adjacent element processor. 
[0120] Since buffer means In the eleventh aspect or the fortieth aspect temporarily buffers access commands to the 
data memories, there is no risk of interruption of the processing even in case of collision of renewal commands to one 
same memory address. 

[0121] According to the Information processing system of the eleventh aspect or the method of computation per- 
formed by using an Information processing system of the fortieth aspect, the system comprises a buffer mechanism 
for temporarily buffering access demand to the data memory, and this has an effect of increasing the speed of process- 
ing of the information processing system by avoiding any interruption of processing due to collision of requests for 
renewal and thus avoiding any increase of computation time. 

[0122] The buffer memories in the twelfth aspect or the forty-first aspect write the data sent from adjacent element 
processors, respectively, In communications between adjacent element processors, as data with object name, and the 
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element processors complete reception of data from adjacent element processors by searching data with object name 
at optional timing from the buffer memories. This makes It possible to prevent any disturbance to the operation Inside 
the respective element processors by the data communications between adjacent element processors. 
[0123] According to the Information processing system of the twelfth aspect or the method of computation performed 
by using an Information processing system of the forty-first aspect, the element processors are farther provided, re- 
spectively, with a buffer memory in which to write, in communications between adjacent element processors, data sent 
from the adjacent element processors as data with object name, and the element processors complete reception of 
data from the adjacent element processors by searching data with object name at optional timing from the buffer 
memory, and this has an effect of obtaining higher operating speed by preventing any interference to the operation 
inside the respective element processors due to data communications between adjacent element processors. 
[0124] By keeping the matching memories In the thirteenth aspect or the forty-second aspect waiting for data nec- 
essary for at least one of the computing operations, each of the processing units in the element processors performs 
a certain amount of prescribed operations only at the point in time when the necessary data Is available In the corre- 
sponding matching memory, transfers the results to the matching memory, sends out data to the adjacent element 
processor and also receives data from the adjacent element processor, thus enabling automatic synchronization be- 
tween data and faster processing with less overhead of communication. 

[0125] According to the information processing system of the thirteenth aspect or the method of computation per- 
formed by using an information processing system of the forty-second aspect, the processing unit performs a certain 
amount of prescribed operations only at the point In time when the necessary data is available in the matching memory, 
transfers the results to the matching memory, sends out data to the adjacent element processor and also receives data 
from the adjacent element processor, and this has an effect of enabling automatic processing with less overhead of 
communication and Increasing the speed of processing of the information processing system. 
[01 20] The program memories in the fourteenth aspect or the forty-third aspect are available for rewriting the program 
stipulating the operations of the processing units and, in the case where the element processors are integrally controlled 
by host computer for example, make it possible to change the program by loading a new program in the program 
memories Inside the processing units from the host computer through a network. 

[0127] According to the information processing system of the fourteenth aspect or the method of computation per- 
formed by using an Information processing system of the forty-third aspect, the system is constructed by comprising 
a program memory available for rewriting the program stipulating the operations of the processing unit and, In the case 
where the host computer Integrally controiDng the element processors is connected through a network, for example, 
It becomes possible to change the program by loading a new program in the program memory inside the processing 
unit from the host computer, and this has an effect of providing an information processing system highly applicable to 
general purposes. 

[01 28] The processing units in the forty-fourth aspect load In advance the data memories of the element processors 
with the Internal point bits, the external point bits and the boundary condition bits so that they may perform operations 
by using one same computation program regardless of the kind of lattice points, and change the processing to the 
element processors by using the Internal point bits, the external point bits and the boundary condition bits. 
[0128] According to the method of computation performed by using an information processing system of the forty- 
fourth aspect, the processing unit performs operations by using one same computation program capable of changing 
the processing In the element processors with the internal point bits, the external point bits and the boundary condition 
bits, and this has the effect of making it possible to perform operations by using one same computation program 
regardless of the kind of lattice points and reduce the scale of the information processing system by simplifying the 
computation program. 

[01 30] Each of the processing units in the fifteenth aspect or the forty-fifth aspect works based on a data flow system 
which enables simultaneous execution of a plural num ber of programs assigned to different areas of each of the program 
memories. And, one same processing unit can execute the processing of a plural number of lattice points simultaneously 
by assigning a plural number of programs for processing the respective lattice points to different areas of the corre- 
sponding program memory. 

[0131] According to the Information processing system of the fifteenth aspect or the method of computation performed 
by using an information processing system of the forty-fifth aspect, the processing unit works based on a data flow 
system which enables simultaneous execution of a plural number of programs assigned to different areas of the program 
memory, and It executes the processing of a plural number of lattice points simultaneously by assigning a plural number 
of programs for processing respective lattice points to different areas of the program memory, and this has the effect 
of making it possible to reduce the scale of the information processing system and simplify the construction of the 
information processing system. 

[0132] The processing units in the sixteenth aspect or the forty-sixth aspect work based on a dynamic data driven 
system capable of simultaneously executing one same program on the input data of a plural number of sets with the 
use of Identifiers. 
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[0139] According to the Information processing system of the sixteenth aspect or the method of computation per- 
formed by using an Information processing system of the forty-sixth aspect, the data flow system Is a dynamic data 
drive system capable of simultaneously executing one same program on said input data of a plural number of sets with 
the use of Identifiers, and this has the effect of easily realizing simplification of construction of the information processing 
system. 

[0134] The processing units in the forty-seventh aspect calculate the address of the data memories which store data 
regarding adjacent element processors based on the value of the identifiers assigned to the input data. And they refer 
to and renew the data memory by using the address of the data memories. For that reason, computation can be made 
easily by dynamic data driven system even in an information processing system which performs, individually for the 
respective processing units, data communication between the processing units of adjacent element processors asyn- 
chronously and repeatedly. 

[0135] According to the method of computation performed by using an information processing system of the forty- 
seventh aspect, the system calculates the address of the data memory which stores data regarding adjacent element 
processors and refers to and renews the data memory by using the address of the data memory, based on the value 
of identifiers assigned to the Input data, and this has the effect of easily realizing simplification of construction of the 
Information processing system. 

[0136] Because the processing units In the seventeenth aspect or the forty-eighth aspect are provided with hardware 
circuits in which computing elements are wired in the shape of a data flow graph according to the wired-logic for part 
or whole of the data processing Dow, the respective data passes through the sequence of arithmetic elements one by 
one from the top of the data graph to be submitted to the processing one after another to automatically reach the final 
arithmetic element. When a series of operations are over, the results of computation are written in the data memory 
through data bus Inside the processor and are also written at the same time in the data memory of the adjacent element 
processors through the connection between element processors. 

[0137] According to the information processing system of the seventeenth aspect or the method of computation 
performed by using an Information processing system of the forty-eighth aspect, the data processing unit is provided 
with a hardware circuit In which computing elements are wired in the shape of a data flow graph according to the wiring 
logic for part or whole of the data processing flow, and the respective data pass through the column of computing 
elements from the top of the data flow graph to be submitted to processing one after another and automatically reach 
the final computing element When a series of operations are over, the data processing unit writes the results of com- 
putation in the data memory through the data bus inside the processors and writes those results of computation, at 
the same time, in the data memory of the adjacent element processors to complete the series of processing operation. 
Therefore, this invention has the effect of realizing high-speed operation by extracting the maximum parallel processing 
specialized for differential operations. 

[01 38] The processing units in the eighteenth aspect or the forty-ninth aspect load in advance the data memories of 
the element processors with the internal point bits, the external point bits and the boundary condition bits so that the 
computation may be made with hardware circuits according to one same wiring logic regardless of the kind of lattice 
points, and change the processing of data flow graph according to the Internal point bits, the external point bits and 
the boundary condition bits. 

[0139] According to the information processing system of the eighteenth aspect or the method of computation per- 
formed by using an information processing system of the forty-ninth aspect, the processing unit Is provided with a 
hardware circuit according to one same wiring logic capable of changing the processing of the data flow graph according 
to the internal point bits, the external point bits and the boundary condition bits, and this has the effect of making it 
possible to perform computations with a hardware circuit according to one same wiring logic regardless of the kind of 
lattice points and reduce the scale of the Information processing system by simplifying the computation program. 
[0140] The processing units in the nineteenth aspect or the fiftieth aspect can increase, when executing operations 
for cumulatively adding the respective results of multiplication of first Input by second input, third Input by fourth input, 
fifth input by sixth Input, (2 X N - 1)-th input by (2 x N)-th input to 2N pieces of input data, the speed of operation 
processing which frequently uses those operations such as Navler-Stokes equation, for example, by directly calling 
the arithmetic device. 

[0141] According to the information processing system of the nineteenth aspect or the method of computation per- 
formed by using an information processing system of the fiftieth aspect, the processing unit is provided with computing 
device for directly executing operation codes for cumulatively adding the respective results of multiplication of first Input 
by second input, third Input by fourth Input, fifth input by sixth input, seventh input by eighth input to 8 pieces of Input 
data, and executes operations of data flow graph according to the wiring logic to data input at the respective input 
terminals of the multiplier and outputs the results at the output terminals of the multiplier, thus presenting an effect of 
increasing the speed of solving equations such as differential equations, etc. 

[0142] The element processors in the fifty-first aspect level the number of times of communications at aD of the lattice 
points by Increasing the number of times of communications of the lattice points with small number of times of com- 
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munlcations for solving the governing equation of a field by having dummy communications even for lattice points with 
small number of times of communications. 

[0143] According to the method of computation performed by using an information processing system of the fifty- 
flret aspect, the number of times of communications at all the lattice points is levelled, and this produces an effect of 
* raising the efficiency in parallel operations. 

[0144] The element processors In the fifty-second aspect level the number of times of operations at aD of the lattice 
points by increasing the number of times of operations of the lattice points with small number of times of operations 
for solving the governing equation of a field by having dummy operations even for lattice points with small amount of 
operations. 

10 [0145] According to the method of computation performed by using an information processing system of the fifty- 
second aspect, the processing unit levels the number of times of communications at all the lattice points by performing 
dummy operations even for lattice points with small number of times of operations, and this produces an effect of 
raising the efficiency In parallel operations. 

[0140] The element processors In the fifty-third aspect Improve the efficiency of communications between lattice 
1 * points by assigning the communication information between lattice points to the respective lattice points. For example, 
the element processors Improve the efficiency of communication between lattice points by assigning the information 
of necessity or not of communication between lattice points and the information about the method of communication 
indicating if It is a communication passing through a network or not, thus eliminating any unnecessary communications. 
[0147] According to the method of computation performed by using an information processing system of the fifty- 
20 third aspect, communication Information between lattice points is assigned to the respective lattice points, and this has 
the effect of improving the efficiency of communication between lattice points and increasing the speed of processing 
of tfio information processing system. Moreover, this invention also has the effect of Improving the efficiency of com- 
munication between lattice points and increasing the speed of processing of the information processing system if 
information of necessity or not of communication indicating the necessity or not of communication between lattice 

29 points and information of method of communication indicating if the communication is made through a network or not 
are assigned to the respective lattice points, for example. 

[0146] The data memories in the fifty-fourth aspect has their memory areas assigned to the respective lattice points, 
and the element processors exchange data between adjacent lattice points through the memory areas and can perform 
the exchange of data between lattice points independently for each lattice point And, the data memories can effectively 

30 hold communications of data for each lattice point by retaining the information for indicating the memory area to be 
used for communication for each direction to be used for communication of each lattice point. 

[0149] According to the method of computation performed by using an information processing system of the fifty- 
fourth aspect, the processing system assigns a memory area of the data memory to each lattice points to make ex- 
change of data between adjacent lattice points through the memory area and provides each of lattice point with infor- 

35 matton for indicating the memory area to be used for communication In each direction where communication should 
be made, and this has an effect of easily realizing high-speed processing by the information processing system. 
[0150] The element processors In the fifty-fifth aspect can perform one same processing regardless if communication 
Is possible or impossible by providing for retention each of the lattice points with Information Indicating a common 
dummy area In the direction where communication is impossible, and does not require any judging program for judging 

*o possibility or not of communication. 

[0101] According to the method of computation performed by using an information processing system of the fifty- 
fifth aspect, each lattice point is provided, for retention, with information indicating a common dummy area in the di- 
rection where communication is Impossible, and this has an effect of making ft possible to reduce the program for 
judging possibility or not of communication and contract the program size. This system is capable of omitting synch ro- 

45 nization, etc. and, therefore, it also has an effect of realizing contraction of program size and increase of processing 
speed 

[01 52] The element processors in the fifty-sixth aspect can omit any unnecessary communications between element 
processors by holding two pairs of data In time sequence for intercommunication when adjacent lattice points are 
assigned over different element processors and by not rewriting the data if any displacement of the renewed data Is 
&> no larger than the prescribed value in comparison with the data of the chronologically previous pair when the data for 
those lattice points has been renewed. 

[0159] According to the method of computation performed by using an Information processing system of the fifty- 
sixth aspect, element processors on both sides mutually hold 2 pairs of data transmitted from the other party In time 
sequence and, when the data for those lattice points has been renewed, the processors do not rewrite the data of the 
55 other party if any displacement of the renewed data Is no larger than the prescribed value in comparison with the data 
of the chronologically previous pair, and this has the effect of increasing the processing speed of the information 
processing system by omitting any unnecessary communication between the element processors. 
[0154] The element processors in the fifty-seventh aspect assign virtual data area simulating lattice points which are 
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closest to the border of the computation area forming the subject of the calculation of equation and found outside the 
computation area and equalize the processing of the lattice points which are immediately Inside the computation are 
with the processing of other lattice points in the computation area. 

[0155] According to the method of computation performed by using an information processing system of the fifty- 
seventh aspect, virtual data area simulating lattice points which are closest to the border of the computation area 
forming the subject of the calculation of said equation and found outside said computation area is assigned to the data 
memory and the data of the lattice point is written in the virtual data area, and this has the effect of enabling simplification 
of processing program or contraction of program size. 

[0156] In the twentieth aspect or the fifty-eighth aspect, by comprising fine node coupling connecting the adjacent 
element processors to each other and coarse node coupling connecting the element processors to each other by 
skipping no less than one element processors between them, it is possible to advance computation with the use of 
coarse node ooupling and fine node coupling and shorten the computing time without lowering the accuracy. 
[0157] According to the information processing system of the twentieth aspect or the method of computation per- 
formed by using an Information processing system of the fifty-eighth aspect, the information processing system ad- 
vances the computation of equations by performing computation by element processors connected by coarse node 
coupling and computation by element processors connected by fine node coupling, end this has the effect of not only 
increasing the convergence speed with computation made by using element processors connected by coarse node 
coupling but also maintain the accuracy of computation by computation made by using element processors connected 
by fine node coupling, thus enabling high-accuracy processing at a high speed. 

[0158] In the fifty-ninth aspect, the computation of equations Is advanced with alternate execution of computation by 
element processors connected by coarse node coupling and computation by element processors connected by fine 
node coupling, the convergence time is shortened with the computation by element processors connected by coarse 
node coupling and the accuracy of computation is compensated with computation by element processors connected 
by fine node coupling. 

[0159] The method of computation performed by using an information processing system of the fifty-ninth aspect 
has the effect of enabling high-accuracy processing at a high speed because this system advances computation of 
equations by alternately performing computation by element processors connected by coarse node coupling and com- 
putation by element processors connected by fine node coupling. 

[0160] In the sixtieth aspect of the present Invention, the computation of equations using selected lattice points only 
simultaneously with the computation of equations of aD the lattice points to improve the converging performance. 
[0161] According to the method of computation performed by using an Information processing system of the sixtieth 
aspect, the system performs calculation of equations using only part of lattice points selected and the calculation of 
governing equations using all the lattice point simultaneously, and this has the effect of shortening the processing time 
of the Information processing system. 

[0162] The algorithm of the calculation of the equations using selected lattice points only in the sixty-first aspect Is 
made to be different from the algorithm used for the calculation of equations using all the lattice points, in such a way 
that emphasis is put on convergence In the former algorithm while emphasis Is put on accuracy of solution In the latter 
algorithm. 

[0163] According to the method of computation performed by using an information processing system of the sixty- 
first aspect, the algorithm of the calculation of equations using only selected lattice points Is different from the algorithm 
used for the calculation of equations using all the lattice points, and this has the effect of making It possible to put 
stronger emphasis on the convergence speed than the accuracy of the algorithm applicable to coarse nodes and to 
sharply reduce the processing time of the Information processing system. 

[0164] The element processors In the sixty-second aspect have no less than 2 pairs of data regarding the adjacent 
lattice points In time sequence and omit the operation of the lattice point concerned if the amount of fluctuation between 
the pairs of data Is no larger than the prescribed amount by comparing those 2 pairs of data. 
[0165] According to the method of computation performed by using an information processing system of the sixty- 
second aspect, the processing unit has no less than 2 pairs of data regarding adjacent lattice points in time sequence 
and omits the operation of lattice points If the amount of fluctuation between the pairs of data Is no larger than the 
prescribed amount, and this has the effect of sharply reducing the processing time of the information processing system. 
[0166] The element processors in the sixty-third aspect can further avoid making any wasteful local computation by 
reducing the value of the prescribed amount for omitting the operation of lattice points depending on the value of 
fluctuation between the pairs of data according to the grade of convergence. 

[0167] According to the method of computation performed by using an information processing system of the sixty- 
third aspect, the value of the prescribed amount for omitting the operation of lattice points depending on the value of 
fluctuation between said pairs of data is reduced according to the grade of convergence, and this has the effect of 
sharply reducing the processing time of the Information processing system. 

[0168] The I/O unit In the twenty-first aspect or the sixty-fourth aspect asynchronously reads output data in the ele- 
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ment processors and, by furnishing that output data with Identifiers necessary for arranging and sorting of data and 
can perform post-processing, even If a plural number of physical values are sent in mixed state, by identifying and 
sorting the respective output data with the identifiers. 

[01 69] According to the information processing system of the twenty-first aspect or the method of computation per- 
formed by using an information processing system of the sixty-fourth aspect, the output I/O unit asynchronously reads 
output data in the element processors and furnishes that output data with identifiers necessary for arranging and sorting 
of date, and this has the effect of performing post-processing by using output data provided with identifiers end per- 
forming high-speed processing while outputting the intermediate results of non-steady computation. 
[0170] The output data Image processing unit in the twenty-second aspect or the sixty-fifth aspect collects output 
data from the output units of the respective element processors and, to transmit the situation of progress of computation 
to outside, generates visible images of the data in real time based on the output data arriving asynchronously from all 
the element processors. 

[0171] According to the Information processing system of the twenty-second aspect or the method of computation 
performed by using an information processing system of the sixty-fifth aspect, the Information processing system is 
constructed by comprising output data image processing unit for collecting output data from the output I/O unit and 
generating visible images of the output data in real time based on the output date arriving asynchronously from ail the 
element processors and can output intermediate results of non-steady computation without disturbing the operation 
of the element processors, and this has the effect of performing high-speed processing while outputting intermediate 
results of non-steady computation. 

[0172] The coordinate conversion unit in the twenty-third aspect or the sixty-sixth aspect can use different coordinate 
systems i.e. coordinate system suitable to input and coordinate system suitable to computation by converting from the 
first coordinate system used by the host computer to the second coordinate system used by the element processors, 
and converting from the second coordinate system used by the element processors to the first coordinate system used 
by the host computer, and thus facilitates the formation of lattice. 

[0173] According to the information processing system of the twenty-third aspect or the method of computation per- 
formed by using an information processing system of the sixty-sixth aspect, the information processing system is pro- 
vided with coordinate conversion unit for converting, when transferring data from the host computer controlling the 
element processors to the element processors, from the first coordinate system used by the host computer to the 
second coordinate system used by the element processors, and converting, when transferring data from said element 
processors to the host computer, from the second coordinate system used by said element processors to raid first 
coordinate system used by said host computer, and this has the effect of facilitating formation of lattices and Improving 
the accuracy of the solution. 

[0174] The first convergence judging units in the twenty-fourth aspect or the sixty-seventh aspect are provided in 
each of the element processors and independently of the processing units forjudging convergence without disturbing 
the computation of the processing unit concerned. Moreover, the plural number of element processors are divided into 
a plural number of groups and the first convergence judging unit of the respective element processors are connected 
in the form of a unidirectional closed loop or in an integrated form In the respective groups by means of first commu- 
nication routes, and they are also connected to the respective first communication routes of each group by means of 
second communication routes, so as to connect all of the second convergence judging units integrating the results of 
judgement of convergence of all first convergence judging units In the respective groups and all of the second conver- 
gence judging units of the respective groups In the form of a unidirectional closed loop or in an integrated form, thus 
making it possible to form judgement of convergence without disturbing the computation In the respective element 
processors and transmit that information to the host computer through the first and the second communication routes. 
[0175] According to the information processing system of the twenty-fourth aspect or the method of computation 
performed by using an information processing system of the sixty-seventh aspect, the processing unit can perform 
judgement of convergence In parallel for each group without disturbing the processing In the element processors, and 
this has the effect of increasing the computation speed. 

[01 76] The plural number of element processors in the twenty-fifth aspect or the sixty-eighth aspect first form judge- 
ment of convergence to either the respective element processors or to no less than one processing means assigned 
to the processors. And, if Judgement of convergence is made, the respective element processors send out a message 
Indicating convergence of either the element processors or the processing means concerned after receiving a message 
of convergence from the adjacent element processors. By repeating those operations, the element processors transmit 
messages one after another according to the route and judge convergence of the whole system. 
[0177] According to the Information processing system of the twenty-fifth aspect or the method of computation per- 
formed by using an Information processing system of the sixty-eighth aspect, the information processing system com- 
prises e process for forming judgement of convergence for each of the element processors or for each of no less than 
one processing means assigned to the processors, and a process forjudging convergence by transferring message 
one after another according to the prescribed route, and the respective element processors can execute operations in 
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parallel without being disturbed in their respective operations, and this has the effect of increasing the operation speed. 
[0178] After the completion of convergence Is judged by the process judging the convergence of the whole part In 
the sixty-ninth aspect, it becomes possible, by repeating the respective processes, to form similar judgement of con- 
vergence again and, even In the case of existence of any unstable lattice points, judge twice with a time lag so as to 
ensure the Judgement of convergence of the entire system. 

[0179] According to the method of computation performed by using an information processing system of the sixty- 
ninth aspect, the convergence Judging unit forms similar judgement of convergence again by repeating the respective 
processes alter the completion of general convergence has been Judged by the processes which judge the general 
convergence, and this has the effect of correcting forming judgement of convergence of the entire information process- 
ing system even in the present of any unstable lattice points. 

[0180] By confirming convergence In multiplex ways by messages transferred one after another according to the 
plural number of routes in the twenty-sixth aspect or the seventieth aspect, it becomes possible to form similar judge- 
ment of convergence at least twice for one lattice point even in the case of existence of any unstable lattice points and 
ensure the Judgement of convergence of the entire system. 

[0181] According to the information processing system of the twenty-sixth aspect or the method of computation 
performed by using an information processing system of the seventieth aspect, convergence is confirmed in multiplex 
ways by messages transferred one after another according to a plural number of routes, and this has the effect of 
correcting forming Judgement of convergence of the entire information processing system even in the present of any 
unstable lattice points. 

[0182] By newly confirming convergence after the completion of convergence of the entire system is Judged by the 
process judging the convergence of whole system in the seventy-first aspect, it becomes possible to form judgement 
of convergence at a time even in the presence of any unstable lattice points so as to ensure the judgement of conver- 
gence of the entire system, though this leads to once stopping the processing of the entire element processors. 
[01 83] According to the method of computation performed by using an information processing system of the seventy- 
first aspect, the convergence Judging unit forms judgement of convergence again by stopping the processing of all 
element processors or the processing means, after the completion of general convergence has been judged by the 
processes which judge the general convergence, and this has the effect of correcting forming judgement of convergence 
of the entire information processing system even in the present of any unstable lattice points. 
[01 84] The storing means in the seventy-second aspect judges, after being Initialized by a process of Initializing the 
storing means for storing the results of judgement of convergence, convergence or not of the computation for lattice 
points for the respective lattice points taken charge by the processors after passage of the prescribed time In which 
the entire system Is believed to have sufficiently converged and, if it is judged that convergence has not yet been 
reached, sets the storing means to the prescribed logical value and judges convergence by checking the content of 
the memory in the storing means, thus making ft possible to cope even with cases where the system is continuously 
unstable. 

[0185] According to the method of computation performed by using an information processing system of the seventy- 
second aspect, the Information processing system Is constructed by comprising a process of.inltializJng the storing 
means for storing the results of judgement of convergence, and a process of judging convergence or not of the com- 
putation for lattice points for the respective lattice points taken charge by the processors and, if fit Is judged that con- 
vergence has not yet been reached, sets the storing means to the prescribed logical value, and this has the effect of 
maintaining the system workable even in a case where the system gets in a state of constant Instability and thus 
improving the reliability of the information processing system. 

[01 86] The input/output control unit of the seventy-third or the seventy-fifth aspect of the present invention switches 
the direction of communication according to a demand for access to data memory of an element processor, and the 
element processor can read out/write data at high speed from/in one data memory of the plural number of data memories 
shared with the adjacent element processor. This readout data are data written by the adjacent element processor, 
and the data communication between the adjacent element processors is held with access to the data memory. 
[0187] According to the Information processing system of the seventy-third aspect or the method of computation 
performed by using an Information processing system of the seventy-fifth aspect, the I/O controller Is commonly con- 
nected to a plural number of data memories shared by the adjacent element processors and is constructed in a way 
to control input/output of data with one of the plural number of date memories by switching the direction of communi- 
cation according to demand for access, end this has an effect of increasing the speed of data communication between 
adjacent element compressors. 

[0188] The value of the logical product generated by the wired OR circuit in the seventy-fourth or the seventy-sixth 
aspect of the present invention can tell whether or not the results of judgement of convergence indicating convergence 
of operations in the respective element processors are stored in aD of the plural number of registers. At that time, the 
element processors can continue operations independently of the judgement of convergence by using the results of 
Judgement of convergence stored in the registers. 
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[0189] According to the information processing system of the seventy-fourth aspect or the method of computation 
performed by using an information processing system of the seventy-sixth aspect, the system is constructed in a way 
to generate logical product of the output of a plural number of registers provided respectively in correspondence to a 
plural number of element processors to temporarily storing the results of judgement of convergence of the operations 
8 executed by each of the plural number of element processors by means of a wired OR circuit, this has an effect of 
judging convergence with the value of the logical product generated by the wired OR circuit end shortening the con- 
vergence Judging time while forming judgement of convergence independently of the processing for operations of the 
element processors. 

[0190] The element processors of the seventy-seventh aspect of the present invention can shorten the time required 
10 for the convergence of ell element processors because they shorten the time up to convergence by assigning a smaller 
number of lattice points in the case where they contain unstable lattice points for analysis than in the case there they 
do not contain any unstable lattice points, thus reducing the deviation in the operating time of all element processors. 
[0191] According to the method of computation performed by using an information processing system of the seventy- 
seventh aspect, the system is constructed in a such way that the number of lattice points assigned to element proo- 
fs essors comprising unstable lattice points for analysis in which unstable factors exist is smaller than the number of 
lattice points assigned to element processors not comprising any unstable lattice points for analysis, and this has an 
effect of shortening the time up to convergence by levelling the operating time In the respective element processors. 
[0192] These and other objects, features, aspects and advantages of the present invention wfll become more ap- 
parent from the following detailed description of the present invention when taken in conjunction with the accompanying 
20 drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0193] 

26 

Fig. 1 Is a block diagram showing the construction of the information processing system according to the first 
embodiment of the present invention; 

Fig. 2 Is a block diagram showing the hardware construction of an element processor according to the second 
embodiment of the present Invention; 
30 Fig. 3 Is a block diagram showing the hardware construction of an element processor according to the third em- 

bodiment of the present invention; 

Fig. 4 Is a block diagram showing the hardware construction of an element processor according to the fourth 
embodiment of the present Invention; 
Fig. 5 Is a data graph showing the basic flow of operations; 
35 Rg. 6 is a block diagram showing the construction of the information processing system for performing constructed 

three-dimensional computation by three-dimensional ly connecting two-dimensional boards, according to the fifth 
embodiment of the present Invention; 

Rg. 7 Is a flow chart showing the operation algorithm by asynchronous computation system according to the sixth 
embodiment of the present invention; 
40 Rg. 8 Isa conceptual drawing for explaining the control volume based on finite difference method; 

Rg. 9 is a drawing showing the construction of the data handled by the data memory; 

Fig. 10 is a flow chart showing the computation procedure of the computation algorithm by asynchronous compu- 
tation system; 

Rg. 11 is a flow chart showing the computation procedure of the computation algorithm by asynchronous compu- 
45 tatlon system; 

Rg. 12 a block diagram showing the construction of two-dimensional boards construction in pipeline in the direction 
of time base according to the seventh embodiment of the present invention; 

Fig. 13 is a block diagram showing the construction erf the information processing system according to the eighth 
embodiment of the present Invention; 
so Rg. 14 is a block diagram showing the construction of the element processor constituting the information processing 

system according to the eighth embodiment of the present invention; 

Rg. 15 Is a block diagram showing the construction of the Information processing system according to the ninth 
embodiment of the present invention; 

Rg. 16 Is a block diagram showing the construction of the element processor used for realizing the information 
processing system according to the ninth embodiment of the present invention; 

Fig. 17 is a block diagram showing the construction of the information processing system according to the tenth 
embodiment of the present invention; 

Fig. 18 is a block diagram showing the construction of the element processor used for realizing the Information 
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processing system according to the tenth embodiment of the present invention; 

Fig. 19 is a block diagram showing the construction of the memory used for realizing the Information processing 
system according to the tenth embodiment of the present invention; 

Fig. 20 la a block diagram showing the construction of the element processor according to the eleventh embodime rrt 
of the present Invention; 

Fig. 21 is a block diagram showing the construction of the element processor according to the twelfth embodiment 
of the present invention; 

Fig. 22 is a system flow chart for explaining the construction of the processing unit according to the thirteenth 
embodiment of the present invention; 

Fig. 23 is a block diagram showing the construction of the information processing system according to the fourteenth 
embodiment of the present Invention; 

Rg. 24 is a block diagram showing the construction of the element processor used for realizing the information 
processing system according to the fifteenth embodiment of the present invention; 

Fig. 25 is a block diagram showing the construction of the information processing system according to the sixteenth 
embodiment of the present invention; 

Fig. 26 is a block diagram showing the construction of the information processing system according to the seven- 
teenth embodiment of the present invention; 

Fig. 27 is a block diagram showing the construction of the element processor used for realizing the information 
processing system according to the sixteenth and seventeenth embodiments of the present invention; 
Fig. 28 Is a block diagram showing the construction of the Information processing system according to the eight- 
eenth embodiment of the present invention; 

Fig. 29 is a block diagram showing the construction of the information processing system according to the nine- 
teenth embodiment of the present invention; 

Fig. 30 is a block diagram showing the construction of the information processing system according to the thirtieth 
embodiment of the present invention; 

Fig. 31 Is a plan view showing the state of lattice points for explaining the construction of the information processing 
system according to the twenty-first embodiment of the present invention; 

Fig. 32 Is a drawing showing the layout of the program memory for explaining the Information processing system 
according to the twenty-second embodiment of the present invention; 

Fig. 33 is a drawing showing the layout of the program memory for explaining the information processing system 

by dynamic data flow system according to the twenty-third embodiment of the present Invention; 

Rg. 34 is a conceptual drawing showing an example of colour acquisition function for acquiring different data 

memory addresses for respective colour Identifiers in the twenty-third embodiment of the present invention; 

Fig. 35 is a conceptual drawing showing an example of layout of colour identifiers for realizing colour acquisition 

function; 

Rg. 38 is a drawing showing an example of the construction of the data memory for realizing colour acquisition 
function; 

Rg. 37 is a drawing showing the construction of the data memory for realizing the information processing system 
according to the twenty-fourth embodiment of the present invention; 

Rg. 38 Is a flow chart showing the procedure of communication between lattice points according to the twenty- 
fourth embodiment; 

Rg. 39 is a drawing showing the construction of the data memory for realizing the information processing system 
according to the twenty-fifth embodiment of the present invention; 

Rg. 40 Is a flow chart showing the procedure of communication between lattice points according to the twenty- 
fifth embodiment; 

Rg. 41 Is a conceptual drawing of the computation area showing the memory area in the respective directions for 
making communication for each lattice point; 

Fig. 42 is a drawing for explaining the communication between lattice points according to the twenty-fifth embod- 
iment; 

Rg. 43 Is a drawing showing the construction of the data memory for realizing sharing of data area for dummy 
communication according to the twenty-sixth embodiment of the present invention; 
Rg. 44 Is a flow chart for realizing sharing of data area for dummy communication; 

Fig. 45 is a conceptual drawing for explaining the type of communication between element processors according 
to the twenty-seventh embodiment of the present invention; 

Rg. 46 Is a layout drawing for explaining simplification of program according to the twenty-eighth embodiment of 
the present invention; 

Rg. 47 is a drawing showing the relation of lattice points for explaining the actions of the information processing 
system according to the twenty-ninth embodiment of the present invention; 
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Fig. 48 is a block diagram showing the construction of the information processing system according to the thirty- 
first embodiment of the present invention; 

Fig. 49 is a block diagram showing the construction of the information processing system according to the thirty- 
second embodiment of the present invention; 

Fig. 60 is a block diagram showing the construction of the element processor used for realizing the information 
processing system according to the thirty-third embodiment of the present invention; 

Rg. 51 is a block diagram showing the construction of the information processing system according to the thirty- 
third embodiment of the present invention; 

Rg. 52 is a block diagram showing the construction of the information processing system according to the thirty- 
fourth embodiment of the present invention; 

Rg. 53 is a block diagram showing the construction of the element processor used for realizing the information 
processing system according to the thirty-fifth embodiment of the present invention; 

Rg. 54 is a block diagram showing the construction of the Information processing system according to the thirty- 
fifth embodiment of the present Invention; 

Fig. 55 is a conceptual drawing showing the route of transfer of end flag for explaining the actions of the information 
processing system according to the thirty-sixth embodiment of the present invention; 

Fig. 68 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 
processing system according to the thirty-sixth embodiment of the present invention; 

Rg. 57 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 
processing system according to the thirty-sixth embodiment of the present Invention; 

Fig. 58 Is a drawing showing the construction of the message packet used for outputting result data according to 
the thirty-sixth embodiment of the present Invention; 

Rg. 69 is a conceptual drawing showing the route of transfer of end flag for explaining the actions of the information 

processing system according to the thirty-seventh embodiment of the present invention; 

Fig. 60 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 

processing system according to the thirty-seventh embodiment of the present invention; 

Rg. 61 is a flowchart showing the procedure of sending out of end flag of the element processor serving as starting 

point for the transfer of end flag of the information processing system according to the thirty-seventh embodiment 

of the present Invention; 

Rg. 62 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 
processing system according to the thirty-eighth embodiment of the present invention; 
Rg. 63 Is a flowchart showing the procedure of sending out of end flag of the element processor serving as starting 
point for the transfer of end flag of the information processing system according to the thirty-eighth embodiment 
of the present invention; 

Rg. 64 is a flow chart showing the procedure of arbitration of processing at the lattice points of the information 
processing system according to the thirty-eighth embodiment of the present invention; 
Rg. 65 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 
processing system according to the thirty-ninth embodiment of the present Invention; 

Fig. 66 is a flow chart for explaining the procedure of transfer of end flag to adjacent lattice point of the information 
processing system according to the thirty-ninth embodiment of the present Invention; 

Rg. 67 is a flow chart showing the procedure of judgement of convergence of the information processing system 
according to the fortieth embodiment of the present Invention; 

Rg. 68 is a block diagram showing the construction of a system for solving differential equations using a conven- 
tional multi-processor. 

Fig. 69 is a graph showing the efficiency of parallel operations in the case of use of the conventional method of 
solving differential equations; 

Fig. 70 is a flow chart showing the computation procedure of numerical analysis of fluid by SIMPLE method; 
Rg. 71 Is a block diagram for explaining units of the network constructed with I/O controller and data memories 
according to the forty-first embodiment of the present Invention; 

Rg. 72 is a block diagram forexplaining other units of the network constructed with I/O controller and data memories 
according to the forty-first embodiment of the present invention; 

Fig. 73 Is a block diagram for explaining units of the first network constructed with I/O controller and data memories 
according to the forty-first embodiment of the present invention; 

Fig. 74 Is a block diagram for explaining units of the second network constructed with I/O controller and data 
memories according to the forty-first embodiment of the present Invention; 

Fig. 75 is a block diagram for explaining units of the third network constructed with I/O controller and data memories 
according to the forty-first embodiment of the present invention; 

Rg. 76 is a block diagram for explaining units of the fourth network constructed with I/O controller and data mem- 
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ories according to the forty-first embodiment of the present invention; 

Fig. 77 Is a conceptual drawing indicating an outline of the construction of the convergence judging system ac- 
cording to the forty-second embodiment of the present invention; 

Fig. 78 Is a conceptual drawing indicating form of division of the computation area Including unstable lattice points 
* for analysis according to the forty-third embodiment of the present invention; and 

Fig. 79 is a block diagram indicating a model of data transfer route between registers according to the forty-fourth 
embodiment of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

10 

[0194] The first embodiment of the present Invention will be described hereafter by using drawings. 
[0195] Fig. 1 1s a general construction drawing of the multi-prooessor in the first embodiment ofthe present invention. 
In Fig. 1, 1 18 the host computer Integrally controlling the computation processing, 2 is the main body of Information 
processing system performing computation processing under the control ofthe host computer 1 , 3 are element proo- 
fs essors arranged spatially end constituting the information processing system 2, 4 is a network for exchanging infor- 
mation by connecting between the host computer 1 and the information processing system 2, 5 are processing units 
provided In the respective element processors 3 for performing computation, 6 are memories provided in the respective 
element processors 3 for storing information, and 7 are connections among element processors for communicating by 
connecting between the respective element processors 3. 
20 [0196] In the Information processing system 2 In Fig. 1 , the state of connection among the respective element proc- 
essors 3 Is Indicated for the case of near-neighbor meshes arranged in the form of lattices, but may also be of other 
connecting systems such as linear array, ring, star, tree, systrtc array, completely connected and the Dke. 
[0197] Next, explanation will be given on actions. 

[0198] The host computer 1 , which faces to the Information processing system 2 through the network 4, performs 
2* setting of initial conditions and boundary conditions, Judgement of convergence inside the information processing sys- 
tem 2, output of results of computation In the Information processing system 2, etc The respective element processors 
3, which are connected to one another so that they may mutually communication through the connection 7 between 
element processors, either set Initial conditions and boundary conditions, send out information for Judgement of con- 
vergence of computation or send out results of computation, according to the control command sent from the host 
30 computer 1 to the information processing system 2. As host computer 1, personal computer or work station, etc. is 
used and connected to the Information processing system 2 through the network 4. 

[0199] The respective element processors 3, which are composed of processing unit 6 and memory unit 6, have the 
functions of storing the initial conditions and boundary conditions sent from the host computer 1 through the network 
4 and the connection 7 between element processors in the memory unit 6, performing the prescribed computation 
35 processing in the processing unit 5 immediately after receipt of token for starting computation from the host computer 
1, write the processed data in the memory unit 8 and further continue computation. 

[0200] The element processors 3 are classified in the 3 major categories according to the construction system of the 
hardware. Fig. 2 to Fig. 4 are block diagrams showing examples of hardware construction system of various kinds of 
element processors 3a to 3c. 

<0 [0201] In the respective drawings, 8 Is a data memory provided In the memory unit 6, 9 is a data bus in processor 
for communicating by connecting between the processing unit 6 and the data memory 8, 10 is a floating point arithmetic 
element provided In the processing unit 5, 11 is a program memory provided in the processing unit 5, 12 is a function 
module provided in the processing unit 5, 13 is a hardware circuit in the shape of a data flow graph according to the 
wiring logic provided in the processing unit 5, 14 are computing elements arranged according to the data flow graph 

4* 13, and 16 is a data line connecting the computing elements 14 according to the data flow graph 13. 

[0202] By realizing all operations necessary for solving differential equations with the hardware, it becomes possible 
to realize the maximum degree of parallel processing on the command level. In the solving method with multi-processors 
based on domain decomposition method, the parallel processing on the command level of the respective processors 
depends on the architecture ofthe processors but Is not based on the parallel processing owned by the solving method 

so itself of the differential equations. Fig. 2 Is a construction drawing of an element processor which realized the flow of 
data processing with a data flow graph based on the wired-logic. 

[0203] Explanation will be given on the second embodiment of the present Invention by using drawings. Fig. 2 Is a 
block diagram showing an element processor constituting the information processing system by the second embodi- 
ment of the present invention. In Fig. 2, inside the processing unit 5, the respective arithmetic-logic elements are 
s* arranged along the data flow graph 13 according to the wl red-logic Indicated in Fig. 5 and connected by means of the 
data One 15, for example. Fig. 5 is a data flow graph showing the flow of the basic processing. In the drawing, 16 are 
arithmetic nodes which perform the prescribed operation to the data input along the arc 17 and output the results of 
operation to the next arithmetic nodes along the arc 17. 
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[0204] The arithmetic-logic elements 1 4 have the functions of performing, for example, four arithmetical operations, 
logical operation, readout & writing command of memory, absolute value, exponential operation, etc. 
[0205] If the token (a message calling for node address or start of computation) sent from the host computer 1 through 
the network 4 gets Into the processing unit 5 through the data bus 9 Inside the processor, element processor 3a have 

s access first to the data memory 6 such as DRAM (dynamic random access memory), SRAM, flash memory, etc., for 
example, and prepare the necessary data by absorbing the quantity of state of their own processing unit 5 stored in 
the data memory 8 and the quantity of state of the adjacent processing unit 5. The respective data pass through the 
sequence of arithmetic-logic elements 14 one after another from the top of the data flow graph 13 to be submitted to 
computation processing and automatically reach the final arithmetic-logic element 14. 

w [0206] When there are completed a series of operations as the respective data pass through the sequence of arith- 
metJc-loglc elements 14 one after another from the top of the data flow graph 1 3 to be submitted to com putatlon process- 
ing end automatically reach the final computing element 14. The arithmetic-logic elements 14 write the results of com- 
putation In the data memory 8 through the data bus 9 Inside the processor and also write them in the data memory 6 
of the adjacent element processor 3a through the connection 7 between element processors to complete the series 

is of computation processing. In that case, the processing unit 5 acts as a special purpose computer which executes only 
the series of processings described in the data flow graph 13, thus providing a high computing speed. 
[0207] Next, explanation will be given on the third embodiment of the present invention by using drawings. 
[0208] Fig. 3 is a block diagram showing an element processor constituting the information processing system by 
the third embodiment of the present invention. By arranging a reloadable program memory in the respective element 

20 processors, It becomes possible to provide them with flexibility in their function of special purpose computers. The 
processing unit 5 Indicated in Fig. 3 is constituted with a floating point arithmetic element 10 and a program memory 
11 Instead of a sequence of arlthmetlo-logic elements 14 according to the data flow graph 13. The operations are 
performed by farther loading the program sent from the host computer through the network 4 to the program memory 
11 of the respective element processors 1b through the connection 7 between element processors. 

25 [0209] The content of the program memory 11 is reloadable and, therefore, it is possible to change the program by 
loading a new program in the program memory 11 inside the processing unit 5 from the host computer through the 
network 4. Therefore, any change of algorithm lor solving differential equations or supplementation of function can be 
made with simple loading of new program In the program memory 11. 

[0210] As It has been described above, even a multi-processor of a construction specialized for increasing the op- 
30 erating speed can be provided with flexible functions of change of algorithm, supplementation of function, eta with 
rewriting of the program. According to the processing unit indicated in Fig. 3, it becomes possible to freely select and 
change the solution algorithm and thus greatly expand the scope of applications of computer compared with the 
processing unit of Fig. 2, by changing the program. 

[0211] By providing a reloadable program memory in the element processors, It becomes possible to change the 
35 program by rewriting the content of the program memory and reaOze change of algorithm or scheme for solving differ- 
ential equations or supplementation of functions, etc. on one same hardware. Fig. 1 and Fig. 3 indicate respectively 
the construction of an Information processing system and the construction of an element processor for loading new 
algorithm, scheme or model In the program memory of the respective element processors In the first embodiment of 
the present invention. 

40 [0212] In Fig. 3, the content of the program memory 11 is reloadable and, therefore, It Is possible to change the 
program by loading a new program in the program memory 11 inside the processing unit 6 from the host computer 1 
through the network 4. The respective element processors 3b are constructed in a way to perform operations by op- 
erating the floating point computing element 10 according to the software written in the program memory 11. If any 
program written with new algorithm, scheme and model to the element processors 3b is Input from the host computer 
1 p the program memory 11 on all the element processors is reset, the program is assigned to the respective element 
processors 3b through the network 4 and the connection 7 between element processors, the program in the program 
memory 11 is rewritten and, upon completion of this procedure, the respective element processors 3b get ready for 
starting computation. 

[0213] As explained above, even a multi-processor of a construction specialized for Increasing the operating speed 
60 can statically reconstruct model, algorithm and scheme to have flexible functions with rewriting of the program. 

[021 4] By providing a data memory the content of which is erasable with command from host computer in the element 
processors, it becomes possible to change the boundary conditions by rewriting the content of the data memory and 
realize computation of various analytical models on one same hardware. 

[0216] In Fig. 1, the respective element processors 3 can be constructed in a way to perform operations by either 
s* having a hardware circuit in the shape of a data flow graph 13 linked by data line 15 in the arithmetlo-logic element 14 
or operating the floating point computing element 10 according to the software written In the program memory 11. 
Moreover, en initial condition setting program is loaded on the host computer 1, and boundary conditions can be set 
freely by using this setting program. If any boundary conditions of computation are input in the host computer 1, the 
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initial condition setting program is started, the data memory 8 on all the element processors is reset, the initial conditions 
and boundary conditions of computation are assigned to the respective element processors 3 through the network 4 
end the connection 7 between element processors and, upon completion of this procedure, the respective element 
processors 3 get into the stale of standby for starting computation. 
* [0216] Therefore, operations to various boundary conditions and initial conditions can be realized with simple pro- 
cedure of loading conditions in the host computer, by using an information processing system 2 of one same multi- 
processor construction. 

[0217] Next, explanation will be given on the fourth embodiment of trie present invention by using drawings. 
[0216] Fig. 4 is a block diagram indicating the element processors constituting the information processing system 

10 by the fourth embodiment of the present invention. The processing unit 5 indicated in Fig. 4 has composite functions 
of the processing unit constructions indicated in Fig. 2 and Fig. 3. In the processing unit 5 indicated in Fig. 4, operation 
processing functions frequently used for solving differential equations are constructed in the form of hardware circuits 
in a fonctton module 12. When performing computation, the (Unction module 12 is called out as required for functioning, 
by describing a command for calling function module 12 in the program. Therefore, the computation time during that 

16 period can be sharply reduced compared with a case in which the software makes the floating point arithmetic element 
10 function for ell the computation processing, as wfth the processing unit indicated in Fig. 3. 
[0219] The respective element processors 3c are connected with adjacent element processors 3c in the form of 
lattice by means of the connection 7 between element processors for data communication, and data communication 
between mutually adjacent element processors 3c is held through this connection 7 between element processors. 

20 Moreover, a network 4 is provided between aO the element processors 3c or peripheral element processors 3c and the 
host computer 1 , to have communication of either convergence judging flag or results of computation. 
[0220] Next, the fifth embodiment of the present invention will be explained by using Fig. 6. 
[0221] By three-dlmensionally connecting two-dimensional computing boards constituted with element processors 
3 arrange on a two-dimensional plane indicated in the first embodiment, it is possible to make extension to three- 

26 dimensional computation. Fig. 6 is a drawing showing the construction of element processors for performing three- 
dimensional computation by trtree-dimensionally connecting two-dimensional computing boards. 
[0222] I n the drawing, 1 is the host computer, 3 are three-dimensionally arranged element processors, 5 are process- 
ing units provided In the element processors 3, 6 are memories provided In the element processors 3 for storing Infor- 
mation, 4 is a network and 18 are two-dimensional computing boards constructed by two-dimenslonally connecting 

30 element processors 3. 1 n this case, unlike the construction in pipeline in the direction of time base of the two-dimensional 
computing boards 18, the network 4 provided between the two-dimensional computing boards 18 is used for the com- 
munication of data for spatial integral calculus. Moreover, the state of connection in the form of three-dimensional array 
between element processors 3 can also be freely changed into lattice type topology, hypercube or pipeline, etc. with 
a change in the wiring of the network 4. 

35 [0223] While, In the above embodiment explanation was given on a case of a near-neighbor mesh where the state 
of connection between element processors 3 is arranged in the form of lattice, the same effects can also be obtained 
even with other connecting systems such as chordal ring, 3-cube, binary 4-cube, etc, for example. 
[0224] Next, the sixth embodiment of the present Invention will be explained by using drawings. 
[0225] Explanation will be given on the method of use of the information processing system 2 of the construction 

40 indicated In the embodiments given earlier. 

[0226] Fig. 1 and Fig. 6 indicate an information processing system of multi-processor construction for sending out 
initial conditions and boundary conditions to respective element processors in the first embodiment of the present 
invention. 

[0227] The information processing system 2 is constructed in a way to perform operations by either having, In the 
45 respective element processors 3b, a hardware circuit in which computing elements 1 4 are connected by means of data 
line 16 in the shape of a data flow graph 13, or operating trie floating point computing element 10 according to the 
software written in the program memory 1 1 as shown in Fig. 3 and Fig. 4. If any initial conditions end boundary conditions 
of computation to element processors are input in the host computer 1 , the data memory 8 on aO the element processors 
3 is reset, the initial conditions and boundary conditions of computation are assigned to the respective element proc- 
60 essors 3 through the network 4 and the connection 7 between element processors and, upon completion of this pro- 
cedure, the respective element processors 3 get into the state of standby for starting computation. 
[0228] Therefore, In the information processing system 2, operations to various boundary conditions end initial con- 
ditions can be realized with simple procedure of inputting conditions in the host computer, by using one same multi- 
processor construction. 

65 [0229] As example of use of the Information processing system 2, explanation will given on numerical analysis of 
fluid based on the domain decomposition method. A fluid which is a continuum is generally described with an equation 
of continuity and an equation of strong non-linearity which Is called Navfer-Stokes equation. When solving the governing 
equation of a field described with those partial differential equations by the finite difference method, first the object 
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phenomenon and area are divided into the smallest elements (lattices) having a fine size in both space and time. On 
these lattices, a solution Is obtained by transforming the governing equation expressed with partial differential equations 
Into a descretized difference equation and solving this difference equation. While, in the method of solution by finite 
difference method, the quantity of state owned by fine space elements are Influenced only by the quantity of state 
owned by elements existing In the Immediate neighborhood, this effect Is produced, In computation, by making com- 
putation processing to the quantity of state owned by adjacent elements. Namely, it is seen that small elements which 
are separate In space from one another do not have any influence one upon another and can be calculated independ- 
ently by just taking the influences of adjacent small elements only into consideration. 

[0230] For example, consider solving an Incompressible fluid by SIMPLE (Semi-Implicit Method for Pressure-Linked 
Equations) method (method which consists in solving equation of motion under a hypothetical pressure and obtaining, 
for the portion not satisfying continuity, a solution eventually satisfying the rule of continuity while correcting pressure 
and flow velocity as described in S.V. Patenter, Numerical Heat Transfer and Fluid Flow(1980), McG RAW-HILL BOOK 
COMPANY, for example). 

[0231] Rg. 70 is a flow chart showing the calculation procedure conventionally used for making numerical analysis 
of fluid by SIMPLE method. In the drawing, 211 indicates a group of routines for solving Navier-Stokes equations of 
motion, 212isagroupofro utines for solving the pressure-correction equation, 21 3 is a routine for performing correction 
of flow velocity and pressure, 214 is a repetition cycle for non-linear convergence, and 215 represents a computation 
cycle for advancing time. Namely, the procedure of thermal hydraulic analysis by SIMPLE method consists in solving 
discrete Navier-Stokes equation of motion with the group of routines 211, solving the flow velocity & pressure deter- 
mining equations one after another with the routine 21 3 and also repeating the computing cycle for time marching 215. 
[0232] In the method of parallel solution using multi-processor which is frequently employed these days, the main- 
stream Is the domain decomposition method which consists In dividing the entire computation space Into a plural 
number of areas, assigning the element processors to the respective areas and transmitting data of the quantity of 
state of borders only between the element processors. 

[0233] A problem in this method is waiting for synchronization regarding the communication between element proc- 
essors. In the current parallel processors for general use, in which data communication between element processors 
is realized with a low-speed remote memory access by packet communication, etc., It is necessary to have a technique 
of using computation algorithms of coarse grain with least possible communication amount to the number of the element 
processors and avoid waiting for synchronization as much as possible. However, there is no way to avoid sudden 
increase of waiting for synchronization in the case where the element processors are constructed in super parallel and, 
therefore, no parallel proportional to the number of processors can be expected. 

[0234] For that reason, a new SIMPLE method in which the lattice points individually calculate the quantity of state 
will be indicated, with special attention to the fact that the solution obtained is sufficiently accurate even if the equation 
of motion and the pressure-correction equation are solved by referring to the quantity of state of the adjacent lattice 
points only. 

[0235] Fig. 7 is a flow chart showing the computation algorithm with which the lattice points #1, #2, ... #n in this 
embodiment calculate the quantity of state independently without making any Interrupt operation In the computation 
of other lattice points. 

[0236] In the drawing, 20 is a group of routines for solving Navier-Stokes equation of motion, 21 is a group of routines 
for solving the pressure -correction equation, 22 is a routine for performing correction of flow velocity and pressure, 23 
is a repetition cycle for non-linear convergence, 24 is a computation cycle for time marching, and 26 is a group of 
routines of element processors assigned to the respective lattice points #1, #2, ... #n. 

[0237] Here, a concrete algorithm will be indicated by taking numerical analysis of fluid of an incompressible two- 
dimensional layer current. When no temperature field is taken into account, a fluid is described with the following 3 
nondlmenslonal equations: 
[0238] Continuous equation is given in Formula 1 . 




(1) 





(3) 
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[02391 Here, the respective symbols represent following: 

u = velocity component In direction x; v = velocity component in direction y; p = pressure; Re = Reynolds number 
[0240] Formula 1 to Formula 3 can be expressed uniformly with Formula 4, in which $ represents an optional variable, 
r la a diffusion term, and Su, Sp are generation terms and take the values given in Table 1 below. 



(4) 



[Table 1] 



Equation 




r 


Su 


Sp 


Continuity Equation 


1 


0 


0 


0 


Equation of Motion (X) 


u 


1/Re 


-dptdx 


0 


Equation of Motion (Y) 


V 


1 / Re 


-dp/ty 


0 



[0241] Fig. 8 indicates a control volume necessary tor differentiation of Formula 4. An optional variable $ is placed 
at the center point P 27 of the control volume 26, the respective points of adjacent $ are defined as E (East) 28, W 
(West) 29, N (North) 30, S (South) 31 , and their center points (intersections between the border of the control volume 
26 and the lattice line) are indicated with small letters (e, w, n, s). Because they are staggered grids, the scalar quantity 
is placed at the position of capital letters and the vector quantity (velocity) is placed at the position of small letters. If 
the Eider explicit method is applied to Formula 4 as scheme for difference of time to make digitizing considering the 
flow velocity coming Into and going out of the control volume 26, Formula 4 changes into Formula 5. 

At AX Ay 

_ (rd$tdx) 9 -(rd*idx) w 

Ax 



+ S " S P*P 



(5) 



[0242] Supposing that <j> is distributed linearly among lattices and that e, w r n, 8 are center points between P and E, 
P and W, P and N, P and S respectively, Formula 6 to Formula 9 can be supposed. 



(6) 

(0 



(8) 



(9) 



[0243] Formula 10 Is obtained by Integrating Formula 5 over the control volume. 



a"* 1 - a 

P At P A*Ay + {u 9 Art 0 - u„Art w ) + (v n Ax$ n - v^Ax^) 
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+ (S u + S p <>,,)AxAy (10) 

10 [0244] Here, definitions will be made as In Formula 11 and Formula 12 regarding convection term C and diffusion 
term D. In those expressions, the suffix letters indicate the position of border face of the control volume 26. 



IB 



26 



C, - u,Ay, C„ = a w Ay, 
C n = v n Ax, C a = v,Ax 



T r 
D. = — i-.Ay, Z> » — — Ay, 

* • (8*)„ 

r r 

D = — 2- Ajc, n a — £_ Ax 
" (8y)„ * (8y), 



.. (H) 



... (12) 



[0245] If the definitions of Formula 11 end Formula 12 ana used, Formula 10 changes to Formula 13. 

SO 

= - <f>p) - O^p - $ w ) + D„(<> w - <> p ) 
40 -*Wp -♦$)♦<$« + S p * p )AxAy (13) 

[0246] If arrangement Is made regarding <j> by applying hybrid scheme as space difference scheme to this Formula 
13, Formula 14 Is obtained. 

46 



60 



83 
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40 



60 



58 



M^' " f Jb + + Wit + tys 

c c 

F, = max [-C e D t - -f, 0], F w = max [C^ + — ^, 0] 
F„ = max [-C„ D K - 5l. 0], F, = max fC„ D, + S 0] 



(14) 



10 ■ . ^ " 2 

= AxAy / At 



fa 

where max {,,} Is a function for selecting the maximum value In the parentheses. 

[0247] As lime difference methods, various methods such as explicit method, implicit method, semi-implicit method, 
etc. However, by evaluating $ p Included In the right member of Formula 1 3 with the value <|ip n+1 of the subsequent time 
aiming at eventually acquiring a steady state solution as an acceleration method, the discretization equation of Formula 
20 15 can be obtained from Formula 14. 

25 whenF p = F e + F w *F„*F a + B n 

G = (S u + S ( fi p )AxAy + a pl fl p (15) 

[0245] Velocity and pressure which ere unknowns of the discretization equation led out by the SIMPLE method will 
30 be related to each other. First, the discretization equation of Formula 2 and Formula 3 of the equations of motion will 
be led out. The variable $ is u , v and, if Formula 16 to Formula 1 9 are defined for velocity component u, the discretization 
equation will become as Formula 20 and, If Formula 21 to Formula 24 are defined for velocity component v, the dis- 
cretization equation will become as Formula 25. 
[0248] Formula 1 6 Indicates the coefficient of w-face regarding speed u. 



K = Ay, x -1—1 



D w = B w x max [0, (1 - 0.5 x |^|)J 



46 W W ^ ' B 



[0250] Formula 17 indicates the coefficient of e-face regarding velocity component u. 
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Kp + «, 

K = Ay, x -E^-i 

D t = B e x max [0, (1 - 0.5 x 
Formula 18 Indicates the coefficient of s-face s regarding velocity component 



v- ♦ v. 

J m bX p X 2 

' e 2 

A?p 

5 a = — 

Z>, = B, x max [0, (1 - 0.5 x |4s|)] 



Formula 19 Indicates the coefficient of n-face regarding velocity component u 



V + V 
A = Sr x _2 E 

■ * 2 
6x p 

5. ? 



D H » B a x max [0, (1 - 0.5 x 



U P F P = u e F e * u w F w *u n F„* u„ F„ + u s F B +Q 
where F m = D w + max (A w , 0) 
F 0 = D 0 + max 0) 
F^D. + max^O) 
F n = D„ + max(-A„.0) 

O = Ay p (P ir -Pp)+8p 0 u p 
Formula 21 Indicates the coefficient of w-face regarding velocity component u 
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Z>„ = B„ x max [0, (1 - 0.5 x 

[0254] Formula 22 indicates the coefficient of e-fece regarding velocity component v. 

15 

u + u 

*• = ~pTT~ - (22) 

0, - B t x max [0, (1 - 0.5 x f^l)] 

[0255] Formula 23 Indicates the coefficient of e-fece regarding velocity component v. 



30 

V- + v. 



* * 2 



35 5, 



Ax 



' *J£ (23> 



D s = 5, x max [0, (1 - 0.5 x j^l)] 

[0256] Formula 24 Indicates the coefficient of n-fece regarding velocity component v. 



A a = Ax. x — 

? 2 

Ax p 

so *" R^y, (24) 

D n = 5. x max [0, (1 - 0.5 x |^|)) 

83 

Vp = v* F„ ♦ v'„ F w ♦ v* F„ + v> 8 + O 
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where F w = D w + max (A w , 0) 
F 9 = D 9 + max (-A 9 , 0) 
F a = D e + max(A a ,0) 
F n = D n + tr*x(-A nt Q) 

Pp s Pw*P 9 *P 9 *P n *^PO 
G = Ax p (P 8 - P p ) + apoVp (25) 

[0257] By soMng Formula 20, an assumed flow velocity u* at point P can be determined. By solving Formula 25, an 
assumed flow velocity V* at point P can be determined. By using the assumed values V* obtained from Formula 20 
and Formula 25 as well as Fp necessary for leading those equations, the coefficient of the pressure-correction equation 
to determine the pressure correction value p\ Formula 26 indicates the pressure-correction equation. The suffix letters 
e and n in the equation Indicate values transferred from the element processors on East side and North side respectively. 

O = Ay p u p - Ay P u pt e + Ax p v p - Ax p v pf n 
whew F w = AypAy p I F p (u) 
F 0 = A yp Ay p IF PQ {u) 
F 8 = AXpAx p IF p {u) 
F n = AXpAXpI F p si(u) 

P' pf=P = P' A + P'j: w + P % nPn + P ' + G (26) 

[0256] The flow velocity and pressure are calculated with the formulas of Formula 27 to Formula 29 by using the 
pressure correction value p' transferred from the element processors on the West side and the South side. 

P P = P P + P' P (27) 
u p = u P AY P I F p {u) X (P' w - P' p ) (28) 

v p = V^Axp / F p (v) X (P' 3 - P' p ) (29) 

[0259] Various kinds of lattice points for computation are conceivable in which discrete points may be generally found 
inside the substance, neighboring the wall or at flow inlet or outlet, etc. Preparing many kinds of computation programs 
(software) and computation circuits (hardware) makes the programming very troublesome. For that reason, the position 
information of the respective points is given in advance to the memory of the respective element processors as initial 
conditions In the form of Internal point & external point bits and boundary conditions, so that one exactly same program 
may be executed for the computation itself. 

[0200] The distinction between internal point and external point of the computation area Is also made by using bits. 
If the discretization point is inside the substance, H will be considered as an external point and, when the discretization 
point te in the fluid, it will be considered as an internal point. The internal point & external point bits are defined for each 
of the variables u, v, p, and the internal point will be given as 1 (on) while the external point wfll be given as 0 (off). For 
example, if the flag for u Is put as bit(u), the following operation processing will be made before deciding the coefficient 
of the discretization equation of u: 
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F p = bit(u)F p + bit(u) 

F w = bit(u)F^ F t - bit{u)Ff 

F, =■ bm)F? F n * bit(u)F n > ... (30) 

Cf = bit(u)G + bit(u)u p 

where bit(u) represents "not bit(u)". 

[0261] As for boundary conditions, the boundary conditions of the kinds given below can be set Each of the variables 
u P v, p has a flag Indicating use or not of wall function. However, the bits are set in the respective directions of w, e, s p 
n to judge on which face the wall exists. Trie operations of Formula 31 to Formula 33 are performed before deciding 
the coefficient of the discretization equation of u and v. 
[0262] Formula 31 indicates the wall face conditions using wall function. 



1 2Ax p 
x = 

F ? = F,- bit(w)F w - bit{e)F t - bh®F s 

- bitQi)F H + [bit(w) + bit(e) 

+ bttjs) + bit(n)]x ni) 
F w = bim F w 

F 6 = We) F t 

F s = bW) F a 

F n * bit(n) F H 

where i represents shear force. 
[0263] Formula 32 Indicates the outlet boundary conditions of gradient 0. 

fa = Afffw)^ + m*h* 9 * btt(*h* 9 ♦ blt(n)^ n (32) 

where <frp1 represents variable having outlet boundary conditions of gradient 0. 
[0264] Formula 33 Indicates fixing of boundary values. 
[0265] Fixing of boundary values, 

+ W(s) 3 (2^-^) 

+ «(»)aP*Mi«r-W (33) 

where ^ represents variable having fixed boundary conditions and <J> bourKj represents a fixed boundary value. 
[0266] The variable i>p can be described as In Formula 34 by combination of Formula 32 and Formula 33. 
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4>p = mm P * WW x ♦ «p3) (34) 

[0267] Aa tor the nature of operations necessary for tfie respective computation cells, almost the same operation Is 
* made though under different boundary conditions, because of the characteristics of governing equation of fluid, if only 
the boundary conditions are mapped initially on the data memory by providing various kinds of bit according to said 
setting of boundary conditions, one same program can be used regardless at which position of border, an Internal point 
or an external point of computation space the computation cell may be. Therefore, just loading the program in which 
to describe a series of operations necessary for calculating the physical value of one lattice point on the program 
10 memory of the element processors 3 will be enough. To perform computation by sweeping the computation lattices 
one after another using that one kind of operation program, first the lattice point number of the computation lattice (or 
base address on the data memory 8) is outlet on the circulating pipeline (84 in Fig. 21) inside the element processor 
3 as Input token. If the lattice point number which Is the Input token Is 2, for example, the computing element will start 
computation by having access to the memory area assigned to the second lattice point. Since there Is no limitation of 
18 order In the Input of tokens, it is also possible to input lattice numbers at random, in addition to just advancing the 
computer sequentially as 1 , 2, 3 ... 

[0268] Fig. 9 is a drawing showing the data construction inside the data memory provided In the element processor. 
As shown in the drawing, the inside of the data memory provided in the element processor Is divided into partitions 32 
in the same number as the number of lattice points to be taken charge by one processor, and the data necessary for 
20 computation of one lattice point Is mapped in the respective partitions. 

[0269] Fig. 10 and Fig. 11 are flow charts showing the flow of the algorithm which has so far been described. In the 
drawings, 33 to 63 are the respective computing functions A to I Indicated respectively hereafter. 

(a) The element processor 3 has a function of determining the predict value u* by solving the equation of motion 
25 in the direction x. In the drawing, 33 is function A (u) of determining the coefficient F w of the equation of motion in 

the direction x In the computation for w-face of u, 34 is function B (u) of determining the coefficient F e of the equation 
of motion In the direction x in the computation for e-face of u, 36 is function C (u) of determining the coefficient F e 
of the equation of motion In the direction x In the computation for s~fece of u, 36 is function D (u) of determining 
the coefficient F n of the equation of motion In the direction x in the computation for n-face of u, 37 is function E (u) 
30 of determining the generation term G of the equation of motion in the direction x, and 43 is function G (u) of 

determining the coefficient F p of the equation of motion in the direction x and determining the predict velocity 
component, u* from the respective coefficients and the generation term. 

(b) The element processor 3 has a function of determining the predict value v* by solving the equation of motion 
in the direction y. In the drawing, 38 is function A (v) of determining the coefficient F w of the equation of motion in 

35 the direction y In the computation for w-face of v, 39 is function B (v) of determining the coefficient F e of the equation 

of motion In the direction y in the computation for e-face of v, 40 is function C (v) of determining the coefficient F a 
of the equation of motion In the direction y In the computation for s-face of v, 41 Is function D (v) of determining 
the coeffdent F n of the equation of motion In the direction y In the computation for n-faoe of v, 42 Is function E (v) 
of determining the generation term G of the equation of motion in the direction y, and 44 is function G (v) of deter- 
mining the coefficient F p of the equation of motion In the direction y and determining the predict velocity component 
V* from the respective coefficients and the generation term. 

(c) The element processor 3 has a function of determining the pressure correction value p\ In the drawing, 51 is 
function F of determining the generation term G of the equation for p\ 52 is function G (p) of determining p' by 
solving the equation for p* on the basis of the generation term obtained with the coefficient of equation of motion 

4* and F. 

(d) The element processor 3 has a function of correcting flow velocity and pressure. In the drawing, 55 Is function 
H of determining u, v, p on the basis of p\ 

(e) The element processor 3 has a function of setting boundary conditions, in the drawing, 45, 46, 57 to 59 are 
functions I of setting boundary conditions based on bit data given in Initial values. 

fio (f) The element processor 3 has a ftjnction of transferring boundary conditions set In the drawing, 47, 48, 53, 60 
to 62 ere functions of transferring the boundary conditions set respectively in the previous processes to the adjacent 
element processors 3. 

(g) The element processor 3 has a function of judging convergence or not In the drawing, 48, 50, 54, 63 are 
functions of judging convergence or not 

53 

[0270] By connecting those functions according to the flow chart indicated in Rg. 11, the computation code for one 
lattice point is completed. 

[0271] Next, the algorithm for computing one lattice point is mapped on the program memory 11 of the element 
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processor 3. The mapped program starts by using as input token the lattice point number information tag sent from 
the host computer 1 through the network 4 and the connection 7 between element processors. In the case where a 
plural number of lattice points are mapped on one element processor 3, the data transferred between the element 
processors 3 through the connection 7 between element processors Is an Information held by the lattice point positioned 
ff at the outermost part of the respective divided areas constituted with a plural number of lattice points. For that reason, 
by directly having access to the data memory on the adjacent element processors 3 through the connection 7 between 
element processors provided between the element processors 3, it becomes possible to perform data transfer between 
processors and make rapid analysis by asynchronous control by eliminating data transfer overhead between lattice 
points. 

10 [0272] Moreover, it is also possible to map one lattice point on one element processor, use at least the same number 
of element processors 3 as the number of computed lattice points, arrange a multiport memory to be described later 
between the element processors 3, construct a super parallel system connecting them with the connection 7 between 
element processors and obtain convergent solution by performing exchange of data without mutually disturbing the 
processing of the element processors 3 while keeping data communication asynchronously between mutually adjacent 

1Q element processors 3. 

[0273] Furthermore, synchronous analysis between element processors 3 also becomes possible by mapping a 
plural number of lattice points on one element processor and adopting synchronous communication system between 
element processors. In this case also, sharp Improvement in the processing speed is possible compared with the 
method synchronous solution using conventional processors for general use, because the processing unit 5 inside the 

20 element processors 3 has a special purpose of high-speed operation suitable for solving partial differential equations. 
[0274] Although explanation was given, in the above embodiment, about a case where the algorithm for computing 
one lattice point Is mapped on the program memory 11 of the element processor 3, the same effects can also be 
obtained even if the computing element 14 is connected with the data line 15 to form a hardware circuit in the shape 
of data flow graph 13, as a matter of course. 

25 [0275] In addition, efficient parallel processing of the communication between mutually adjacent element processors 
3 Is possible when It is a message communication of sending messages to a suitable process by means of address 
symbols of message when sending message (data) from a certain process to another and that a plural number of 
different (may also be same) commands are executed in parallel on a plural number of data. 
[0276] Next, explanation will be given on the seventh embodiment of the present Invention. 

30 [0277] As method of approximation of time differential, there are Euler explicit method, Euler implicit method, second- 
order Crank-Nicholson scheme, third-order Richardson scheme, etc. because of differences In the time steps of quantity 
of state to be referred to. In the third-order Richardson scheme, for example, when the current time is given 88 n, the 
quantity of state necessary for obtaining the value of next time n+1 is determined by the values of time n-1 and n. In 
the method using a conventional Neumann's computer, there was a limitation to the scale of computation due to short- 

30 age of memory capacity, because it was necessary to store the values of time n-1 and n excessively on the common 
memory. 

[0278] Even such problem of shortage of memory can be easily solved with the use of a plural number of two- 
dimensional computing boards constituted by element processors arranged on two-dimensional plane indicated In the 
first embodiment. Fig. 12 is a drawing showing the construction of an information processing system for approximation 
40 of time differential on two-dimensional space. In the drawing, 66 to 67 are two-dimensional computing boards consti- 
tuted respectively by two-dimensional iy connecting element processors 3, 65 Is a two-dimensional computing board 
taking charge of computation of the value corresponding to time n-1, 66 is a two-dimensional computing board taking 
charge of computation of the value corresponding to time n and 67 is a two-dimensional computing board taking charge 
of computation of the value corresponding to time n+1 . Moreover, 4 is a network for exchanging information by con- 
40 nectlng the respective two-dimensional computing boards in a pipeline and also connecting them to the host computer 1 . 
[0279] in the case of Richardson scheme, for example, the values of time n-1 and n necessary for obtaining the 
value of time n+1 is transferred from the data memory of the respective element processors 3 on the two-dimensional 
computing board 65 for time n-1 and the two-dimensional computing board 66 for time n to the respective element 
processors 3 on the two-dimensional computing board 67 for time n+1 through the network 4. When the computation 
so is over, the results of compulation are automatically stored in the data memory on the respective element processors 
3 on the two-dimensional computing board 67. Tb calculate the value of n+2 in succession, reset the data memory on 
the respective element processors 3 on the two-dimensional computing board for time n-1 where the value of time n- 
1 1s stored, newly transfer the values of time n and n+1 respective as value of time n+2 from the two-dimensional 
computing board 66 for time n and the two-dimensional computing board 67 for time n+1 to the two-dimensional com- 
as puting board 65 for time n-1 through the network 4 for computation and store the result as value of time n+2 in the data 
memory on the respective element processors 3. 

[0280] As explained above, according to the information processing system of the third embodiment, the problem of 
shortage of memory Is solved and the operations become speedier. 
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[0281] This construction in pipeline in the direction of time base also contributes to increase of speed in the judgement 
of convergence of two-dimensional analytical solution. In the solution of differential equations, in which the judgement 
of convergence is made when the difference between the calculated value at the current time n and the calculated 
value at the previous time n-1 has become no larger than a certain standard range, the construction In pipeline in the 
* direction of time base makes It possible to judge convergence with a simple action of comparing the value between 
mutually adjacent two-dimensional computing boards. 

[0282] Next, explanation will be given on the eighth embodiment of the present invention. 

[0283] While the main object of application of the present invention is solution of differential equations, this kind of 

problem Is computed by repeating mutual actions between mutually adjacent lattice points and the computation is 

10 completed with convergence of ail lattice points in a steady state. Namely, hardly any exchange of data is made between 
lattice points which are not adjacent to each other and exchange of data must be made only between mutually adjacent 
lattice points. Moreover, as a general practice, after the object system is divided into lattice points, the computation 
regarding one or a plural number of mutually adjacent lattice points is taken charge by one element processor 3 and 
the computation regarding lattice points adjacent to those lattice points is also taken charge by the adjacent element 

15 processor 3. 

[0284] By paying attention to such characteristics of the object problem, it can be understood that the efficiency of 
computation sharply improves and the desired operations can be completed in a short time if mutually adjacent element 
processor 3 are constructed In a way to share one same memory and to directly have access mutually to the memory 
of the other processor. 

20 [0285] Fig. 13 indicates a part of multi-processor In which element processor 3 are connected by the connection 7 
between element processors and arranged In the shape of square lattices. Moreover, Fig. 14 is a drawing showing the 
Internal construction of one piece of element processor indicated in Fig. 13. In the drawing, the processing unit 5 is 
connected wtth the data memory 6 incorporated in the same processor and can have access to this data memory 8. 
Furthermore, this processing unit 5 is connected with adjacent element processors through input/output control circuit 

25 (hereinafter referred to as "I/O 11 ) 71 . 

[0286] By being connected with the adjacent element processor 3d through the I/O 71 , it becomes possible (for the 
processing unit 6) to directly have access to the data memory 8 in the adjacent element processor 3d as If it were a 
memory In one same element processor. Conversely, the adjacent element processor 3d can also have access to the 
illustrated data memory 6 in the same way. 

30 [0287] To mutually have access to the memory between mutually adjacent element processors, the I/O 71 in the 
drawing must discharge the function of arbitration against the demand of memory access from the adjacent element 
processor for the data bus 73 between the processing unit 6 and the data memory 8. However, explanation on this 
point will be omitted here, because arbitration tor data bus is a known technology which is already used in general 
processors and realized with the use of VME bus, for example. 

36 [0288] As explained above, in the element processor, the processing unit 5 is connected with the data memory 6 
incorporated in one same processor so that the processing unit 5 may have access to the data memory and it also is 
connected with the adjacent element processor 3d through i/O 71. This makes it possible to reduce the overhead for 
data communication between element processors 3d and Improve the efficiency of computation. 
[0289] While a general case was explained In the explanation of the construction of element processors given above, 

40 just referring to the data (namely reading) of adjacent lattice points Is enough and no renewal (namely writing) Is 
required as far as solution of differential equations is concerned. It is apparent that, if the operation is limited to referring 
only, there is no risk of collision between requests fbrrenewai to one same memory address and therefore the arbitration 
of data bus for memory access can be simplified. 

[0290] However, it is expected that a plural number of requests for reference to one same memory may occur at a 
feirly high frequency. I n that case, there is a fear that accumulation of processing may be produced eventually requiring 
a longer computation time with the memory of a single board. 7b avoid such problem, It will be all right if a multi-port 
memory Is adopted to enable simultaneous processing of reference to a plural number of memories. 
[0291] Explanation will be given on the ninth embodiment of the present Invention realized for that purpose. 
[0292] Fig. 16 Indicates a case where the memory Is shared with 6 adjacent element processors. In the drawing, 3 

so is an element processor and 7 is a network. 

[0293] Fig. 1 6 indicates the internal construction of one element processor, and the data memory 8 has 7 ports. The 
data memory 8 Is connected with 61/0 71 through data bus 75 and is also connected wtth the processing unit 5. By 
having such construction, it becomes possible to simultaneously handle requests for reference to memory from the 
element processor concerned and 6 adjacent element processors. 

63 [0294] If the operation Is reference only, it can be realized wtth a comparatively simple control circuit However, in 
the case where the information processing system Is for general purposes, there are cases where (the processing unit) 
is requested to also have a function of writing In the memory of adjacent element processors 3e. By having this kind 
of construction, it becomes possible to also write one same data in the data memory 8 of adjacent dement processors 
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3e simultaneously wfth writing in the data memory 8 of the element processor 3e concerned, thus greatly contributing 
to the shortening of computation time In the processing involving collision of references to one same memory address. 
[0295] Still more, it is not always necessary to have the memory unit to be shared inside the element processors. 
Tnoretore, explanation wiO be given on the drawing of the tenth embodiment of the present Invention realized with 

o some modification to this memory unit 

[0296] Fig. 1 7 indicates the construction of an information processing system in which the commonly shared memory 
unit is arranged outside the element processor. Moreover, Fig. 1 8 indicates the construction of an the element processor 
constituting the information processing system Indicated in Fig. 1 7 and the relation of connection with the memory unit. 
In the drawing, 76 is a memory provided outside the element processor 3f, 5 is a processing unit connected with the 

10 external memory 76 by means of I/O 71, 8 is a data memory provided inside the element processor 3f end directly 
connected with the processing unit 5 and Is connected with the processing unit inside the element processor 3 through 
I/O 71. Namely, as shown in Fig. 17, the memory 76 is arranged on the data route between element processors. By 
arranging this way, It becomes possible to directly refer to and renew four memories 76 shared with adjacent element 
processors through I/O 71. 

is [0297] The method indicated hereafter is available as example of a method for exchanging data between mutually 
adjacent element processors 3f through said memory 76. 

[0298] Namely, exchange of data between element processors 3f is completed by stipulating in advance the address 
or scope of address for writing by respective element processors 3f, writing data in the predetermined address area 
of the memory 76 and reading out that address area wfth the element processor 3f on the data receiving side. 

20 [0299] In Fig. 17, the symbol "P" described Inside the memory 76 indicates the scope of address for writing by element 
processor 3, while the symbols "S", D N a , "P, "W indicate the scope of address for writing by element processors 3f. 
[0300] While explanation was given on an example Including data memory 6 in the element processors 3f in this 
embodiment, it is apparent that the same effects can be obtained even by element processors having no data memory 
8 if the data to be stored in the data memory 8 Is stored in the memory 76 connected to the element processor. 

25 [0301] I n the above explanation, ft was given on a case where single-port memory is used as memory 76. However, 
by using multi-port memory also in the case of memories arranged on the data route, It becomes possible to shorten 
the computation time In the same way as the case of incorporating the data memory 8 in the element processors 3f 
explained in Rg. 16. 

[0302] However, even when multi-port memory Is adopted, the processing is Interrupted leading to Increase of com- 
30 putation time in case of collision of requests for renewal for one same memory address. To avoid such problem it is 
effective to Introduce a buffer mechanism for temporarily storing requests for reference to/renewal of memory. Rg. 19 
indicates a multi-port memory Introducing a buffer memory for that purpose and shows only the portion corresponding 
to the memory 76 in the information processing system of Fig. 17. 

[0303] in the drawing, 76a is a mufti port memory with buffer mechanism, 79 is a 2-port memory and 60 is a buffer 
30 mechanism. The buffer mechanism 8 can temporarily store the data received from the 2-port memory 79 through the 
connection 7 between element processors or the data to be written in the 2-port memory 79. 
[0304] As explained above, because a buffer mechanism 80 is provided in each port of the 2-port memory 79 arranged 
between mutually adjacent element processors 3f, there is no fear of interruption of processing even in case of collision 
of requests for renewal for one same memory address. Any increase in computation time resulting from such interruption 
40 of processing can also be avoided . 

[0308] In the mil I tl port memory indicated in Rg. 19, explanation was given on a case where the buffer mechanism 
80 is provided in each port of the 2-port memory 79. However, it Is apparent that the buffer mechanism may also be 
provided for the respective ports of the multiport (memory) and that it is also effective to have one buffer for a plural 
number of or all ports. 

4* [0308] Next, explanation will be given on the eleventh embodiment of the present invention. By providing a buffer 
memory for retaining communication data for each of the element processors, any disturbance to operations of the 
respective element processors due to data communication between element processors can be prevented. 
[9307] Rg. 20 is a block diagram showing the construction of an element processor having a buffer memory for 
retaining communication data. In the drawing, 5 is a processing unit provided In the element processor 3g, 7 Is a data 

so bus connecting between element processor 3g, 82 is a buffer memory provided in the element processor 3g and 
connected to the data bus 7 between element processors and the processing unit 5, and 8 is a date memory provided 
In the element processor 3 and connected to the processing unit 5. 

[0308] The respective element processors 3g write the data sent from the adjacent element processor 3g through 
the data bus 7 between element processors in the buffer memory 62 as data with object name, and search the data 
05 with object at optional timing from this buffer memory to complete reception of data from the adjacent element processor 
3g. According to an information processing system formed with element processors having the construction indicated 
in Fig. 20, faster processing becomes possible without disturbing the operations of the processing unit 5 inside the 
respective element processors 3g or floating point arithmetic element, for example, with data communication between 
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mutually adjacent element processors. 

[0309] Next, explanation will be given on the twelfth embodiment of the present invention. Fig. 21 is a block diagram 
showing an example of the construction of element processor of the information processing system according to this 
twelfth embodiment 

o [031 0] I n the drawl ng , 3h Is an element processor, 1 0 is a floating point arithmetic element provided inside the element 
processor 3h, 83 is a matching memory (data pool) provided inside the element processor 3h t 11 is a program memory 
provided inside the element processor 3h, 8 Is a data memory provided inside the element processor 3h, 84 is a pipeline 
tor exchanging information by connecting the floating point arithmetic element 10, the data memory 8, the program 
memory 11 and the data memory 8, and 7 is a connection between element processors for exchanging information by 

io connecting between element processors. 

[0311] There are a wide variety of data to be exchanged by communication between mutually adjacent element 
processors 3h. In this embodiment, the respective data are given object names enabling discrimination in advance of 
the kind of processing to be made next. A data with object name Is Input In the form of a data packet on the pipeline 
84 provided with the matching memory (data pool) 83, circulates and is sent out to the matching memory (data pool) 

15 83 in side the adjacent element processor 3h through the connection 7 between element processors. The floating point 
arithmetic element 10 searches Inside the matching memory 83 at optional timing and starts operations as soon as 
any data with object name necessary for performing an optional processing is ready. 

[0312] At that time, It is necessary to secure a sufficient storage capacity for the matching memory 83 because the 
programs solving partial differential equations tends to have large degrees of parallel processing. 

20 [0313] The above explanation Involved a case In which the processing unit of the element processor 3h is constructed 
with a floating point arithmetic element 10 in the above description. However the same effects can be obtained even 
with a design In which the arithmetic-logic elements 14 are connected with data line 16 to form a circuit In the shape 
of data flow graph 3 in hardware indicated in Fig. 2. Another alternation Is a construction in which the portions frequently 
appearing during an operation are formed Into a circuit in hardware inside a functional module 12 as shown in Fig. 4 

2* while other portions are operated with the floating point arithmetic element 10 according to instructions from the program 
memory 11. 

[0314] As described above, by automatically synchronizing different data, It becomes possible to make processing 
rapidly with little communication overhead. 

[0315] Next, explanation will be given on the thirteenth embodiment of the present Invention. 
30 [0316] As mentioned earlier, in the solution of Navier-Stokes equations by SIMPLE method, operations of cumula- 
tively adding the respective results of multiplication of first input by second input, third Input by fourth Input, fifth input 
by sixth input and seventh input by eighth input frequently appear, and increase of speed of this processing is one of 
the keys to shortening of the computation time. Fig . 5 is a flow graph schematically expressing the operation of Formula 
35. 

35 

x = axb + cxd + exf + gxh (35) 

[0317] Fig. 22 shows a hardware construction realized for integration of this processing. In the drawing, 85 are mul- 
40 tlpllers which execute multiplication of two data Input from the input terminal and output at the output terminal. 88, 87 
are adders which execute addition of two data input from the input terminal and output at the output terminal. The 
output terminal of the multiplier 85 and the input terminal of the adder 86 are connected to each other by means of 
data wire 88. Moreover, the output terminal of adder 86 and input terminal of adder 87 are connected to each other by 
means of data wire 89. By connecting adders and multipliers in the above construction, it becomes possible to execute 
40 the operations indicated in Fig. 5 and output the results at the output terminal of the adder 87 against the data input 
at the respective input terminals of the multipliers. 

[0318] In this way, element processors having so-called macro-functional blocks reaGzed by connecting computing 
elements by means of data wire greatly contribute to the increase of speed in the solution of differential equations such 
as Navler-Stokes equations, etc. 
60 [0310] Next, explanation will be given on the fourteenth embodiment of the present invention. 

[0320] Fig. 23 is a block diagram showing the construction of a communication circuit for realizing asynchronous 
and repetitive data communication between mutually adjacent element processors for the respective element proces- 
sors. 

[0321] In the drawing, 31 is an element processor, 5 is a processing unit for performing operations provided in the 
55 element processor 31, 8 is a data memory for storing the data necessary for the processing of the processing unit 
provided in the element processor 3 1, and 90 is a data bus for exchanging the information described in the data memory 
8 of the element processor 3 between mutually adjacent element processors 31. 

[0322] Next, explanation will be given on actions. If, In the processing by the processing unit 5, any information of 
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adjacent element processor 3i is required, that adjacent element processor 3i performs memory access actions of 
referring to the necessary Information described in the data memory 8 of the counterpart utterly asynchronously and 
repeatedly with the adjacent element processor 3i. The operation performed by the respective element processors 3i 
in the solution of partial differential equations Is local because it depends not on the Information held by remote element 
* processors 31 but only on the information held by the adjacent element processor 3i. Therefore, communication between 
mutually adjacent element processors 3i shall preferably be maintained utterly asynchronously and repeatedly through 
the data bus GO, without disturbing the processing of the processing unit 5. 

[0323] With a construction realized in a way to allow utterly asynchronous and repeated memory access between 
mutually adjacent element processors 3i without disturbing the processing of adjacent element processors 31, the 

10 communication between mutually adjacent element processors 3i ere held utterly asynchronously and repeatedly 
through the data bus 90, without disturbing the processing of the processing unit 5. This makes it possible to continue 
the operations performed on the respective element processors 3i in an utterly asynchronous way. 
[0324] While the processing unit 5 and the data memory 8 have data communication between the respective element 
processors 31 by using a data bus 90 capable of referring to the memory without disturbing the action of the counterpart 

15 In this embodiment, the data communication system between element processors becomes synchronous when there 
is any need of synchronous operation for interrupting the action of the processing unit 5 in the adjacent element proc- 
essor 31 to have access to the data memory 8 and this increases the communication overhead but does not put any 
obstacle to the obtention of analytical solution. 

[0325] Next, explanation will be given on element processor for obtaining an output easy for arranging and sorting 
20 of data even with asynchronous absorption of data, according to the fifteenth embodiment of the present Invention. 
[0328] Rg. 24 is a block diagram showing the construction of an element processor for obtaining an output easy for 
arranging and sorting of data even with asynchronous absorption of data, realized by attaching an Identifier for each 
quantity of state inside the memory. 

[0327] In the drawing, 3i is an element processor, 70 is a processing unit for performing operations provided in the 
25 element processor 3i, 8 is a data memory for storing the computation data including output data end provided in the 
element processor 3i, 90 is a data bus connecting element processors, 91 is an I/O unit for sending out data to outside 
the element processor 31 by attaching identifier depending on the kind of quantity of state to be output and provided 
In the element processor 31, and 92 Is an output data bus for transferring the data output through the I/O unit 91 to the 
host computer. 

30 [0328] Next, explanation will be given on actions. The results of computation obtained with a series of processings 
of the processing unit 70 are constructed with a plural number of kinds of quantity of state and yet the sampling time 
interval of the output data is very much shortened because high-speed operation Is intended. For that reason, the 
output data Is given, at its head or tail, an identifier which enables identification of the quantity of state. The output data 
Is continuously output to outside its own element processor 3i asynchronously and in a large volume in the form of 

39 packet communication in which a plural number of physical values are carried in a mixed way, and sent out to the host 
computer through the I/O unit 91 to be submitted to post-processing. At that time, because an output data bus 92 is 
provided for each of the element processors 3i, the data output through the I/O unit 91 is output asynchronously without 
disturbing the output of the adjacent element processors 3i, and transferred to the host computer. Moreover, because 
the output data Is provided with an identifier enabling judgement of that quantity of state, the host computer can perform 

40 post-processing by discriminating the data for arranging and sorting even if output data consisting of a plural number 
of physical values in the form of packet is sent in a mixed way. 

(0329] Therefore, according to this embodiment, intermediate results of trangent calculation can be output without 

disturbing the operation of the adjacent element processors 3i, thus achieving high-speed operation. 

[0330] The following output method of synchronous type is also possible as a matter of course: The host computer 

45 sends an output command to the respective element processors 3i through the output data bus 92, and the element 
processors 31 execute the reference function of the data memory provided inside the I/O unit 91 according to the 
command received, absorb the output data of the required quantity of state in the order described in the command and 
prepare a sequence of output data. The sequence of output data generated in the I/O unit are sent out through the 
data bus 92. Therefore, It becomes possible to make output at optional time by adjusting the timing of sending output 

50 commands from the host computer to the respective element processors 3i. 

[0331] Next, explanation will be given on the sixteenth embodiment of the present invention. 
[0332] Rg. 25 Is a construction drawing of the entire computer system for collecting asynchronously output data from 
the respective element processors and visualizing them in real time, according to the sixteenth embodiment of the 
present Invention. 

55 [0333] In the drawing, 3J Is an element processor performing operations, 93 is an I/O unit for output for converting 
data when outputting the results of operation to outside and provided in the element processor 3j, 94 is an output data 
bus for transferring the data output through the I/O unit 93 for output on the respective element processors, 95 is a 
data output control unit for converting the parallel data In at least equal number with that of the element processors 3J 
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transferred from the respective element processors 3j through the output date bus 94 into serial data, 96 is a serial 
data bus for transferring the data converted Into serial data through the data output control unit, 97 Is a post-processing 
unit for sorting the data sent through the serial data bus 96 by type and performing post-processings such as inversion 
of scale, double scaling, averaging, correction of colour tone, indication of contour, contour processing, fringe treatment, 
vector Indication, etc., 98 is an Image output data bus for transferring image data prepared by the post-processing unit 
97, and 99 is a display for indicating the image data transferred through the image output data bus 96. 
[0334] Next, explanation will be given on actions. 

[0336] The output data bus 94 is directly connected with the element processor 3j, and the external output data of 
the respective element processors 3j Is output asynchronously through the I/O unit for output 93. Trie output data is a 
sequence of data containing a plural number of physical values, at the point in time when it is output from the element 
processors 3J. Moreover, at the time of output, the data construction is made in the form of packet by attaching Identifier 
showing the element processor number to either the head or the tail of the data sequence to make clear from which 
element processor 3j the data is output. The data sequence is collected to the data output control unit 95 through the 
output data bus 94 and converted Into one serial data from parallel data in a number at least equal to that of the element 
processors. Moreover, the data sent to the post-processing unit 97 through the serial data bus 96 are assigned ac- 
cording to the identifiers of element processor number and also split and arranged according to the identifiers of quantity 
of state. The arranged data is submitted to post-processings for visualization such as contour processing, fringe treat- 
ment, vector indication, etc. and turned into visual images on the display 99 through the image output data bus 98. 
Therefore, according to this embodiment, intermediate results of trangent calculation can be output without disturbing 
the operation of the element processors 3), thus achieving high-speed operation. 

[0336] It is also possible, naturally, to output the data output asynchronously from the I/O unit for output through the 
post-processing unit 97 and freely control the time intervals of visualization by setting the sampling time Inside the 
post-processing unit 97. 

[0337] If the output data bus 94 is connected to the data output control unit 95 from the respective element processors 
3 as shown in Fig. 25, the wiring becomes a complicated one and this presents a risk of making miniaturization of 
equipment difficult or frequently producing manufacturing troubles. 

[0338] Explanation wilt be given on the seventeenth embodiment realized for the purpose of solving that problem 
based on drawings. Fig. 26 is a construction drawing of an entire computer system for solving the said problem by 
minimizing the output data bus 94. The output data bus 94 Is a single data bus connecting all the element processors 
3j and the data output control unit 95. Moreover, 100 is the host computer and 101 Is a network connecting the host 
computer and 100 and the output data bus 94. Others are the same as those In Fig. 26. 
[0339] Next, explanation will be given on the actions. 

[0340] The output data is a sequence of data containing a plural number of physical values. The host computer 
controls the timing of sending the output data from the respective element processors 3j to the output data bus 94. 
The respective element processors 3j send out output data to the output data bus through the I/O unit for output 93. 
At that time, the data construction is made in the form of packet by attaching identifier showing the element processor 
number to either the head or the tall of the data sequence to make clear from which element processor 3j the data Is 
output The data output control unit 95 rearranges the data coming in through the output data bus 94 according to the 
identifier of element processor number and convert them Into one serial data. Moreover, the data sent to the post- 
processing unit 97 through the serial data bus 98 are classified and arranged according to the identifiers of quantity of 
state. Trie arranged data is submitted to post-processings for visualization such as contour processing, fringe treatment, 
vector indication, eta and turned into visual images on the display 99 through the image output data bus 98. Therefore, 
with the construction of minimum output data bus, intermediate results of trangent calculation can be output without 
disturbing the operation of the element processors 3j, thus achieving high-speed operation. 
[0341] Next, explanation will be given on the eighteenth embodiment of the present invention. In conventional parallel 
computers, which are mainly Intended for high-speed computation, a problem was that increase of computing speed 
does not necessarily lead to increase of speed of the entire operations because, when either indicating the results of 
computation of stationary computation or indicating the results of computation at every moment of trangent calculation, 
It is necessary to suspend the operation of all element processors or keep other element processors on standby by 
suspending their operations until the results of all the element processors are available. 

[0342] Ffg. 27 is a construction drawing of an element processor indicating an example for increase of speed of 
entire computation by parallel indication of results of computation, and Fig. 28 Is a general construction drawing of a 
multi-processor composed of element processors indicated in Rg. 27. 

[0343] In the drawings, 3s Is an element processor, 70 is a processing unit for performing computation provided in 
the element processor 3s, 6 Is a memory unit for storing the computation Information necessary for the computation 
of the processing unit 70 and provided in the element processor 3s, 107 is an output information processing unit for 
receiving the computation Information stored in the memory unit 8 without suspending the computation of the processing 
unit 70 and keeping that information as it is If one computation lattice point Is assigned to one element processor but, 
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in case a plural number of lattice points are woven in one element processor, processing the information of those lattice 
points, 104 is a computation Information bus In One direction for exchanging data necessary for the operations between 
the memory units 6 inside the element processor 3s in the line direction, 105 is a computation information bus in 
direction of sequence for exchanging data necessary for the operations between the memory unit 8 Inside the element 

o processor 3s In the direction of sequence, 108 is an internal computation Information bus for exchanging data necessary 
for the operations between the processing unit 70 and the memory units 8 or data of results of computation, 108 is an 
output processing information bus for transmitting the computation information stored in the memory units 70 to the 
output Information processing unit 107, 110 are display elements such as elements for converting computation infor- 
mation into trarramisslvity of light like liquid crystal display element or using Kerr effect of changing magnetic strain into 

10 reflection of light, colour liquid crystal display element for conversion into change of colour, etc., 111 are lamps for 
emitting light from the back side of the display element 1 1 0 if they are of light transmission type and from the surface 
side of the display elements 1 1 0 in the case of light reflection type, 1 1 2 are reflectors for turning the light of the lamps 
111 into parallel light, and 113 is an output processing Information bus for transmitting output processing information 
from the output Information processing unit 107 of the respective element processors 3s to the display elements 110. 

15 [0344] The memory unit 8 of the respective element processors 3s stores the computation information necessary 
for the computation of the processing unit 70 and exchange data necessary for operations (input date) or data of results 
of computation (output data) between the processing unit 5 and the memory unit 6. Therefore, until the time when the 
computation converges or for displaying the change in the results of computation at every moment of trangent calcu- 
lation, there is no choice but disturb the entire computation by either stopping the computation of the processing unit 

2D 70 to have access to the memory unit 8 or by remaining on standby until the computation of all the element processors 
3s converges to display the results of computation after convergence. In this embodiment, an output Information 
processing unit 107 connected asynchronously with the memory unit 8 (can receive the computation information sent 
out by the memory unit 8 without matching with the memory unit nor disturbing the communication between the memory 
unit 8 and the element processors 3s) is provided inside the element processors 3s, so as to keep the computation 

25 Information sent out by the memory unit 8 as it is if one computation lattice point is assigned to one element processor 
but, In case a plural number of lattice points are woven in one element processor, process the information of those 
lattice points, and also convert digital data into analog data through a digital/analog conversion circuit held as internal 
construction to send it out to the display elements through the output processing Information bus 113. The display 
elements 110 convert computation Information into transmissMty of light like liquid crystal display element, change 

30 magnetic strain into transrnissivfty of light (element using Kerr effect), or convert into change of colour of transmitted 
light (colour liquid crystal display element), etc At that time, the display elements 110 can visually display the output 
processing Information without disturbing the action of the output information processing unit 107 because light is 
Irradiated from the back side if they are of light transmission type and from the surface side of the display elements 
110 In the case of light reflection type by the lamps 111 converted into parallel light by the reflectors 112. 

3* [0345] In this way, this construction makes it possible to visually observe the intermediate progress of computation 
or final results of computation and utilize them for the purpose of control without disturbing the high-speed processing 
of the computation. 

[0346] Next, explanation will be given on the nineteenth embodiment of the present invention. 
[0347] If the output processing Information bus 113 is wired individually to the display elements 110 from the nespec- 
4> tlve element processors 3s as shown In Fig. 28, the volume of wiring becomes enormous making it impossible to secure 
a targe distance between the computer system and the display unit Fig. 29 is a construction drawing of an entire 
computer system for solving the said problem by having a single serial bus between the computer system and the 
display unit 

[0348] In the drawing, 1 1s the host computer, 114 is a data output control unit for converting the output data sent at 
45 random Into serial data by changing their order, 113 is an output processing information bus connecting all the element 
processors 3s and the data output control unit 114, 117 Is a network connecting the host computer 1 and the output 
processing information bus 11 3, 116 is a display data control unit for decomposing the serial data and assigning them 
as display data to the respective display elements 110, 115 is a serial data bus connecting the data output control unit 
114 and the display data control unit 116, and 118 are display data buses connecting the display data control unit 116 
so and the respective display elements 110. Others are the same as those in Fig. 26. 

[0348] Next, explanation will be given on actions. The output information processing unit 107 keeps the computation 
Information sent out by the memory unit as it is if one computation lattice point Is assigned to one element processor 
but, In case a plural number of lattice points are woven in one element processor, processes the information of those 
lattice points. The host computer 1 controls the timing of sending output processing Information from the respective 
85 element processors 3s to the output processing Information bus 113. The respective element processors 3s send out 
output processing Information from the output information processing unit 107 to the output processing information 
bus 113 according to the timing controlled by the host computer 1. At that time, the data construction is made in the 
form of packet by attaching Identifier showing the element processor number to either the head or the tail of the data 
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sequence to make clear from which element processor 3s the date is output. The data output control unit 114 rearranges 
the output processing information coming in through the output processing Information bus 113 according to the iden- 
tifiers of element processor number and converts them into one serial data. Moreover, the data sent to the display data 
control unit 116 through the serial data bus 115 are assigned again to the corresponding display elements 110 respec- 

9 tlveiy according to the Identifiers of element processor number, converted from digital data into analog data through a 
digital/analog conversion circuit and sent out to the respective display elements through the display data bus 1 1 3. The 
display elements 110 convert computation information into transmissivity of light like liquid crystal display element, 
change magnetic strain into reflection of fight or convert It into change of colour of transmitted light, for example. At 
that time, the display elements 11 0 can visually display the output processing information without disturbing the action 

10 of the output Information processing unit 1 07 because light is i rradiated from the back side if they are of light transmission 
type and from the surface side of the display elements 11 0 in the case of light reflection type by the lamps 111 converted 
Into parallel Dght by the reflectors 112. 

[0350] As explained above, this construction makes It possible to secure a large distance between the computer 
system and the display unit, visually observe the intermediate progress of computation or final results of computation 
10 and utilize them for the purpose of control without disturbing the high-speed processing of the computation with the 
minimum data bus construction. 

[0351] Next, explanation will be given on the twentieth embodiment of the present invention. This embodiment is 
explained in fluid analysis, etc., for example. 

[0352] The accuracy of analysis Improves with the use of body fitted curvature using lattices along the surface of 

20 body when analyzing a flow along the external surface of a material having a complicated curved shape if the object 
of computation is an external flow but along the internal wall face having a curved shape if it is an internal flow, with 
the use of a complex curvature using a combination of coarse lattices and fine lattices in the case where the object of 
computation has a locally complicated structure, and with the use of solution fitted curvature which automatically re- 
duces the size of lattices of the part with violent changes in the case where the phenomena produce violent local 

25 changes. However, to use such coordinate systems, it is necessary to constantly perform troublesome conversion of 
coordinates between physical coordinate system and computed coordinate system, This embodiment is intended to 
not only facilitate generation of lattices but also increase the computation speed in the fluid analysis and shorten the 
display time of the results of computation, by providing a special coordinate conversion unit having the function of 
coordinate conversion between the physical coordinate space and computed coordinate space. Fig. 30 indicates the 

30 general construction drawing of the multi-processor. 

[0353] In the drawing, 1 is the host computer, 3 are element processors, 7 is a connection between element proc- 
essors connecting between the element processors for exchanging information, 18 are secondary computation boards 
realized by connecting element processors 3 In the shape of two-dimensional planes by means of the connection 
between element processors 7, 4 Is a network for exchanging information by connecting the element processors in 

35 the secondary computation boards 18, and 120 Is a special coordinate conversion unit for converting coordinates 
provided on the way of the network 4. 

[0354] When Initial conditions and boundary conditions are loaded on the multi-processor from the host computer, 
first the coordinate conversion unit 1 20 converts coordinates from physical coordinate system to computed coordinate 
system, and only the information relating to the computed coordinate is input in the multi-processor. Therefore, the 
40 troublesome computation of coordinate conversion Is separated in the element processors and the essential fluid com- 
putation is speeded up. Moreover, while it Is necessary to convert the results of computation for conversion into a 
physical coordinate system easy for visual understanding when out putting the results of computation, the conversion 
into physical coordinate system which prevents the operation inside the element processors is performed by the special 
coordinate conversion unit during transfer from the element processors to the host computer 1 . Therefore, not only the 
output time of the results of computation Is reduced but also theelement processors can maintain high-speed operations 
without disturbing the operations for the purpose of outputting the results of computation. 

[0355] As explained above, according to this embodiment, not only the conversion of coordinates necessary for the 
computation but also the formation of lattices become easier and the accuracy of the solution improves if the special 
coordinate conversion unit performing pre-treatment, post-treatment, etc. of coordinate systems Is supported with hard- 
& ware or software. Moreover, also In the visualization of the results of computation, reduction of display time can be 
achieved through high-speed conversion from computed coordinates to physical coordinates made by using the special 
coordinate conversion unit. 

[0355] Next, explanation will be given on the twenty-first embodiment of the present invention. 
[0357] Fig. 31 indicates computation lattice points in the neighborhood of the border of computation area showing 
35 an example for improving the efficiency of parallel computation made by levelling the load of the respective element 
processors. In the drawing, 125 are lattice points on the border of computation area, 126 are lattice points inside the 
computation area, and 127 are virtual lattice points adjacent to the lattice points outside the computation area on border 
126, all those lattice points corresponding at 1:1 to the respective element processors. 
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[0358] The lattice points on border 1 25, the number of adjacent lattice points of which is smaller than the number of 
the lattice points inside the computation area 126, are smaller in both amount of communication and amount of oper- 
ations than the internal lattice points 126. For that reason, virtual lattice points 127 are provided outside the border for 
equalizing the amount of processing with that of the Internal lattice points 126 by making dummy communication wtth 
o the lattice on border 126 In order to level the amount of processing of all lattice points, by avoiding overhead due to 
synchronization in the case where synchronized processing with other Internal lattice points 126 which reduces the 
processing time for the lattice points on border 1 25 is necessary, but avoiding uneveness of operating load in the case 
where asynchronous computation is performed. 

[0359] While, In the above-mentioned embodiment, explanation was given on a case where the computed lattice 
10 points correspond at 1:1 to the element processors, the same effects as those of the above-mentioned embodiment 
can be obtained , also In the case where a plural number of computed lattice points correspond to one element proc- 
essor, by connecting virtual lattice points 127 to the lattice points on border 125 and giving them the same computation 
load as that tor element processorcontainlng internal lattice points 126 only, because the processing time of the element 
processor containing lattice points on border 126 Is reduced. 
13 [0360] Therefore, according to this embodiment, it becomes possible to level the load of the respective element 
processors end improve the efficiency in parallel operation. 

[0361] Next, explanation will be given on the twenty-second embodiment of the present invention. I n the case where 
the element processor is a so-called Neumann's processor, they make time sharing processing when computation of 
a plural number of computed lattice points is assigned to it, because It is impossible to make parallel operation with a 

20 single Neumann's processor. 

[0362] On the contrary, computation of a plural number of computed lattice points can be executed in parallel with 
the use of a data flow processor, for example, capable of parallel processing on command level. Data flow processor 
Is Introduced by MITSUBISHI DENKI GIHOU, Vo. 66, No. 2, 1992, p.24 to p.25, for example. As it can be seen from 
such description, a data flow processor is capable of parallel processing on the command level. 

20 [0363] As explained above, simultaneously executable commands can be executed independently In a data flow 
(data driven) system. 

[0364] Explanation will be given on the method for realizing a differential equation soMng program based on finite 
difference method on a data flow processor having such characteristics. In the case of application to the element 
processor Indicated in Fig. 21 , for example, a computation program Is assigned to the program memory 11 and the 
30 floating point arithmetic element performs operations according to the data flow graph. 

[0365] Fig. 32 la a drawing showing a program memory in which part of the program tor solving a partial differential 
equation Is written. 

[0366] As shown in Fig. 32, the computation program for lattice points No. 1 and No. 2 is assigned to different areas 
131 , 132 of the program memory 11 . The initial packet input from outside is given in advance command code, destination 

35 node address, etc to have the command executed without passing through the program memory 11 . Execution packets 
are produced when queuing is completed in the matching memory 63. In the data memory 8, access to memory is 
made according to the command code. Of the field constituting the packets output from the data memory, the destination 
node address Is processed by the program memory 11 while the command code and the operand data are treated by 
the operation processing unit 10 respectively. Namely, the program memory 11 renews the destination node address 

40 of the Input packet Into the next destination node address read out and the operation processing unit 10 sets the 
results of operation In the first operand data field. The result packet is either processed again or output to outside as 
final result. The two programs stored In different areas 131 , 132 of the program memory 11 give command code and 
destination node address respectively to the result packet to enable simultaneous operations. On a processor of data 
flow system, those two programs exist in mixed state on the command level and are submitted to parallel processing. 
[0367] Next, explanation will be given on the twenty-third embodiment of the present invention. 
[0366] By performing processing using Identifiers on a dynamic processor of data flow system, It becomes possible 
to execute one same program on a plural number of sets of Input data. This type of dynamic processor of data flow 
system Is indicated in J A. Sharp: "Data Flow Computing 0 , Science Publishing Co. (1986), for example. 
[0369] As shown In Rg. 33, two lattice points (No. 1, No. 2) are assigned to the element processor concerned, and 

60 the program memory 130 stores small main programs 131, 132 for lattice points No. 1, No. 2, respectively. Trie function 
of the m8ln programs 131, 132 is to give those data to the computation kernel unit 133 after putting specific identifier 
numbers (colour Identifiers) to the respective lattice points. In the computation kernel unit 133, which simultaneously 
performs the processings 131 and 132 for lattice points No. 1, No. 2, the two processings are executed completely 
independently of each other because the data are provided with identifiers. 

53 [0370] As explained above, In a data flow processing not requiring any memory access, one same program can be 
executed simultaneously on a plural number of sets of data by Introduction of colour identifiers. However, no exclusive 
control by colour identifiers can be made for data memory access. This problem must be avoided on the programmer's 
responsibility. As a method of avoiding this problem, It is possible to obtain different data memory addresses tor re- 
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spedive colour identifiers by making conversion of colour identifiers and data memory address with the program. 
[0371] Fig. 34 indicates an example of function for storing the content of colour Identifiers (hereinafter referred to as 
"colour request function") in the operand data field. Fig. 34 is a data flow graph showing colour request functions. On 
the right side of the chart, the changes of data packet are also described. Since the drawing is a simple one, only the 
8 colour Identifiers, the left data (first operand) and the right data (second operand) are described as a format of packet. 
In the drawing, 135 to 138 ere nodes for processing data in order. The node 135 executes PC command, the node 
136 OR command, the Node 137 PC command and the last node 138 SWAP command. The function of the commands 
used for the How graph In Fig. 34 will be explained hereafter. 

10 (1) PC is a command for exchanging the colour identifier and the left data for each other. 

(2) OR Is a command for performing logical sum operation on bit-to-bit basis between the left data and the right 
data and storing the result of that operation in the left data. 

(3) SWAP Is a command for outputtlng the left data and the right data by exchanging them for each other. 

18 [0372] Next, explanation will be given on the action of the colour request function. The data packet input shall be 
considered as storing C1 as colour identifier, in the left data and DR1 In the right data. 

[0373] The input data is submitted, in the node 135, to an exchange of colour identifier for left data by the PC com- 
mand. Next, that data is submitted, in the node 136, to OR operation with the constant value "0" and, as a result, the 
data packet of 5 for colour identifier and C1 for left data and right data is output. Moreover, in the node 137, the PC 

20 command is executed to this data packet to turn it Into a packet of C1 for colour identifier, 5 for left data and C1 for 
right data. Lastly, en output packet on the node 138 exchange the left data and the right data for each other according 
to the SWAP command, have colour Identifiers Identical to the Input data packet and stores and outputs the colour 
Identifiers owned by itself in the left data. The output packet from the colour request function could acquire the colour 
Identifier owned by itself as left data. 

20 [0374] Next, explanation will be given on the method for calculating the address of data memory by using the results 
computed with the above-mentioned colour request fonction. 

[0375] Rg. 35 indicates a conceptual drawing of an example of method for assigning colour identifiers to the respec- 
tive lattice points. In the drawing, 139 are lattice points. Explanation will be given on two-dimensional analysis in which 
the lattice points 139 are arranged in the form of mesh. Here, a colour Identifier 0 is assigned at the left top of the lattice 

30 points 139 arranged In the form of mesh in lines N and columns N, and colours are assigned to the respective lattice 
points In such a way that the colour number increases as it progresses In the right direction and the downward direction. 
[0375] In the conceptual drawing indicated in Fig. 35, the symbols at left top of the respective lattice points 139 are 
colour identifiers assigned to the respective lattice points. Because the assignment was made as indicated above, he 
colour identifiers (CN) assigned to the neighboring lattioe points can be calculated as in Formula 36 below by using 

3* the colour identifiers assigned to own lattice points acquired with the colour request fonction. 

[0377] As terms indicating directions, the top will be called north, the bottom will be south, the right east and the left 
west. The equation © In Formula 36 Indicates the colour Identifier assigned to the neighboring lattice point In the 
direction of north of the lattice point concerned. In the same way, the equations© to @ in Formula 36 Indicate the 
colour identifiers assigned to the neighboring lattice point in the south, west and east directions respectively 

40 

©CN = Output value of colour request function - N 
@CN = Output value of colour request fonction + N 

45 ®CN = Output value of colour request fonction - 1 

@CN = Output value of colour request function * 1 (36) 



[0378] As shown In Rg. 36, Y words of the data memory 8 are assigned to the respective lattice points and the 
memory allocation is made in such a way that the address of the data memory assigned to the lattice point the colour 
Identifier of which Is Z may become the area of (Z X Y) to (X x Y + Y - 1). 

[0379] Therefore, the address of the memory assigned to the lattice point the colour identifier of which Is CN is given 
by Formula 37. 

CNA = CNX Y + k-1 (37) 
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[0380] However, k is a predetermined number end means that access is made to the "k"tri word of the data memory 
elements of Y words assigned to the lattice point the colour Identifier of which is CN. 

[0381] As explained above, because the system is constructed in such a way as to perform the conversion of colour 
Identifiers and data memory addresses with a program and have access to different data memory areas for respective 
a colour Identifiers, It is possible to realize asynchronous communication between lattice points at equal values by mu- 
tually referring to the memory of the element processors 3 also in the data flow system, without using any special 
hardware for it. 

[0382] Next, explanation will be given on the twenty-fourth embodiment of the present invention. 

[0383] This type of information processing system for solving differential equations divides the object equation into 

10 differences namely divides the 6pace (time is also Included in some case) in the lattice and, to determine various 
quantities at the respective lattice points (representative points of the divided space), executes convergence compu- 
tation by using the results of computation at neighboring lattice points. In this embodiment, memories for storing the 
results of computation at respective lattice points are assigned to the respective lattice points to realize exchange of 
data between lattice points by referring to those memories. Explanation will be given on an example showing the 

18 method of exchange of data between lattice points hereafter. 

[0384] Fig. 37 Indicates the data memory area assigned for storing the results of computation at one lattice point, in 
the drawing, 140 is an areas assigned to one lattice point In the data memory 6 for storing the results of computation 
at the lattice point, and 141 to 144 are areas laid out in the area 140 and storing the data of flag of necessity or not of 
communication for lattice points in the respective directions of east, west, south and north. 

20 [0385] As shown In Fig. 37, In the area assigned to each lattice point, two kinds of data i.e. (1) data to be referred 
to by the adjacent lattice point of the results of computation at the lattice point, and (2) flag indicating the necessity or 
not of communication in the respective directions of communication (here, the directions are named as directions of 
east, west, south and north on the supposition of two-dimensional construction) and If the communication is processing 
of lattice point laid out in one S8me processor or if it is a communication made through a network, etc. are stored. 
[0386] Fig. 38 is a flow chart showing the flow of processing In the case where communication is held between lattice 
points. First, when making communication between lattice points, the data stored in the predetermined address of the 
data memory area assigned to each direction of communication is read out (step ST1). Next, necessity or not of com- 
munication te judged In step ST2. Namely, (the processing unit) performs a processing for communication If the data 
read out Is "1" because It means that the adjacent lattice point exists in the direction in which the communication is 

30 Intended. On the other hand, if the data read out Is "0°, no processing for communication will be made because it 
means that this lattice point exists on the border of the system and no communication Is required in this direction. 
[0387] Next, In step ST3, the processing unit reads out the data stored in another predetermined address of the data 
memory in which a flag indicating the type of communication Is stored, and tests the data value read out in the same 
way as above. 

3* [0388] In the case where the data read out is "0", It means that communication is made with the lattice point laid out 
In one same processor and (the processing unit) performs address computation by the method indicated in the twenty- 
third embodiment of the present Invention and makes communication between lattice points by referring to and renewing 
the data memory area assigned to the adjacent lattice point. 

[0389] On the other hand, if the data read out is oiherthan "O", it means that the lattice point for making communication 
to is assigned to other processor and, at the same time, the processor No. to which the lattice point of the partner of 
communication is stored in the upper 8 bits of the 32-bit data read out, while the base address of the data memory 
assigned to the lattice point of the partner of communication is stored In the lower 24 bits. As shown in the twenty-third 
embodiment, an off-set address is added to said 32-bit data to compute the address of the data memory storing the 
desired data, if any distributed shared memory access is executed with said data as address, the data packet is trans- 
45 ferred through the network to the processor to which the lattice point for making communication is assigned, to read 
the dealred data, return to the transmitting processor again together with that data and continue the computation of 
the lattice point 

[0390] As explained above, in the multi-processor making communication between lattice points, a field for storing 
the information Indicating possibility or not or method of communication such as direction available for communication 
so or not for each direction of the respective lattice points, communication passing through a network or not, etc. was 
assigned In the data memory, and distributed shared memory access was used in the case of communication passing 
through a network. The mechanism of distributed shared memory access is disclosed In detail In the Japanese Patent 
Laid-open Gazette No. 3-122338. 

[0391] For that reason, both the processing of lattice points on the border and the processing of lattice points inside 
as the border can be realized with one same program and, therefore, the program size can be reduced. 

[0392] Moreover, when the respective lattice points are assigned to a plural number of processors, communication 
between lattice points ranging over the processors can be realized easily without relying on the method of assignment 
of lattice points. 
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[0393] In the twenty-fifth embodiment, twenty-sixth embodiment, twenty-seventh embodiment end twenty-eighth em- 
bodiment of the present Invention to be described hereafter, it is premised that the processing program of a plural 
number of lattice points Is assigned to one same element processor so as to perform exchange of data between the 
lattice points through the data memory In one same element processor. The respective embodiments will be explained 
0 in detail hereafter. 

[0394] Next, explanation will be given on the twenty-fifth embodiment of the present invention. Explanation will be 
given on the computation system having a table which indicates, in a mufti-processor which assigns a memory to each 
lattice point and makes communication between lattice points close to each other through this memory, the memory 
area In each direction for making communication for each lattice point 
10 [0395] Fig. 39 is a drawing showing division of data memory areas. In the drawing, 145 is an area assigned to one 
lattice point inside the data memory to obtain the results of computation at the adjacent lattice point, and 146 to 149 
are areas assigned inside the area 145 end storing the forefront address of the memory area of the lattice point in the 
east, west, south and north directions respectively. 

[0396] As shown In Rg . 39, the forefront address of the memory area of the lattice point in the respective directions 
is is stored in the assigned memory area for each lattice point In the case of this embodiment, the forefront address of 
the memory area of the lattice point of the direction of communication is stored by using 4 data areas 146 to 149 in 
the east, west, south and north directions for each lattice point 

[0397] Fig. 40 is a flow chart indicting the flow of processing in the case of communication. When making commu- 
nication, the forefront address of the memory area of the partner of communication is read out according to the direction 

20 of communication (step ST6). In the case where the desired direction of communication is a direction in which com- 
munication Is impossible, -1 is stored in advance in the area storing the forefront address of the memory area of the 
lattice point of the direction of communication. If the data read Is -1, no communication is made (step ST7). If the 
desired direction is Judged as a direction unavailable for communication, the address off-set of the data to be commu- 
nicated Is added to the forefront address of the partner of communication to obtain the address with which to make 

25 communication actually and perform actual communication (step STB). 

[0396] While an example of a case where the forefront address of the memory area in each direction for making 
communication is stored was indicated in the above embodiment, the present Invention can also be applied to an 
embodiment storing lattice point numbers in the case where memory areas are assigned In the order of lattice point 
numbers, for example. 

30 [0399] Moreover, while an example of a case where the forefront address of the memory area in each direction for 
making communication Is stored was indicated in the above embodiment, the present invention can also be applied 
to embodiments storing identifiers which can specify the memory area to be used for communication such as difference 
between the forefront add ress of the memory area of the lattice point of the transmitting party end the fbrefro nt address 
of the memory area of the partner of communication, distance between lattice points which is a value obtained by 

36 dividing said value by the size of the memory used by one lattice point, and a pointer, for example. 

[0400] Furthermore, while explanation was given on the case where there are 4 directions of communication in the 
above embodiment, the present Invention can also be applied to cases where there are 6 or 8 directions (including 
northeast northwest, southwest, southeast or part of such directions). 

[0401] Still more, while explanation was given on the case where -1 is stored in the direction unavailable for com- 
40 muni cation In the above embodiment, the present invention can also be applied to a case where any unused optional 
value is used as identifier indicating the direction unavailable for communication. 

[0402] Even In the case of a processing in which the lattice point to be used for communication changes in such a 
way that the adjacent lattice point to be communicated with is a little apart during a coarse node processing but comes 
closer during a fine node processing, those processings can be executed with one same program by using rewriting 

45 the area of the Identifier which indicates the memory area of the lattice point in the direction of communication. 

[0403] Moreover, in the past in a case where an area with projections and cuts as shown In Rg. 41 Is taken as an 
object of processing, the object of processing was selected as either a square or a rectangle formed in a way to include 
the area with projections and cuts. For that reason, ft was necessary to make the processing even for lattice points 
which essentially need not be processed. With the application of the present invention, however, the processing can 

so be made directly for an area with projections and cuts, because any lattice points In any optional direction can be set 
as lattice points unavailable for communication. This makes it possible to eliminate the memory area of the lattice 
points outside the area wfth projections and cuts and thus reduce the memory size. It also makes processing of the 
lattice points outside the area with projections and cuts unnecessary, enabling faster processing. This realizes both 
reduction of memory size and increase of processing speed. 

M [0404] For the Identifier used as identifier of direction of communication In this embodiment, using the value obtained 
by dividing the difference between the forefront address of the memory area of lattice point number or lattice point of 
transmitting party and the forefront address of the memory area of the partner of communication by the size of the 
memory used by one lattice point Is more effective, compared with a case of using either address or difference of 
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address itself, for reducing the number of bits required as memory and thus reducing the memory size used. 
[0405] While the above embodiment provides a method of access to memory area in the case where a large number 
of lattice points are woven in a single element processor, the communication can be simplified with the embodiment 
Indicated hereafter as far as the communication between element processors is concerned. 

* [0406] When element processors are connected In a lattice type topology, a problem Is the communication wtth 
element processors in diagonal directions. Fig. 42 is a drawing showing a communication system with element proc- 
essors in diagonal directions. In the drawing, 151 indicates an element processor in the NWdirection, 152 is an element 
processor In the N direction, 153 is an element processor in the NE direction, 164 is an element processor in the W 
direction, 165 is an element processor under computation, 156 is an element processor in the E direction, 157 is an 

10 element processor in the SW direction, 158 is an element processor in the S direction, and 159 is an element processor 
In the SE direction respectively. 

[0407] Moreover, in the drawing, 160 Indicates a virtual communication route in the NW direction, 161 is a virtual 
communication route in the SE direction, 162 is a network between element processors, 163 Is a virtual communication 
route In the SW direction, and 164 is a virtual communication route In the NE direction. In the case where the element 

is processor 155 has communication with the element processors 151 , 159 in the NW and SE directions, the conventional 
method consisted in either installing virtual communication routes 160 and 1 61 directly between the element processors 
161 and 155 and between the element processors 1 55 and 158 to make communication or making data communication 
by way of the element processor 152 or 158. However, such methods had a lot of problems such as complicated 
communication network with the former method and deterioration of speed performance of the entire system due to 

20 large waiting time for synchronization for communication, etc. If the respective element processors have the data held 
by the element processors in higher and lower positions in duplicate, the communication system in the communication 
with the element processor 151, for example, can be simplified by referring to the data of element processor 161 on 
the memory of the element processor 164. However, the data held in duplicate must be rewritten every time when the 
original data is renewed. 

2* [0408] Furthermore, the same effects can be obtained also by having in duplicate the data of the element processors 
In the left and right sides instead of the element processors in higher and lower positions and having access, when 
making communication with the element processor 151 for example, to the data of element processor 151 stored on 
the memory of the element processor 152. 

[0409] Next, explanation will be given on the twenty-sixth embodiment of the present Invention. 
30 [0410] A multi-processor which assigns a memory to each lattice point and makes communication between lattice 
points dose to each other through this memory retains Information indicating the memory area In each direction for 
making communication for each lattice point and especially has information Indicating a common dummy area in the 
direction unavailable for communication. 

[041 1] Fig. 43 Indicates a layout drawing of memories used in the twenty-sixth embodiment. 

35 [0412] In Fig. 43, 170 Is a memory area assigned to the lattice points forming the subject of processing inside the 
data memory 8, 171 is a memory area assigned to one lattice point Inside the memory area 170, and 172 is a memory 
area for one lattice point assigned inside the data memory 8 as dummy area in the direction unavailable for commu- 
nication separately from the memory area 170 assigned to the lattice points forming the subject of processing. A memory 
area for one lattice is secured as dummy area in the direction unavailable for communication in addition to the memory 

40 area for the lattice points forming the subject of the actual processing . An Identifier Indicating the dummy area is stored 
as Identifier indicating the direction unavailable for communication in the area in which to store the memory area for 
the lattice points in the respective directions of communication assigned to the lattice points forming the subject of the 
actual processing. 

[0413] Fig. 44 indicates the flow of processing of this embodiment For establishing communication, the identifier 
Indicating the memory area of the lattice point In the direction of communication is read (step ST10). After that, the 
communication processing Is made to that area regardless rf that identifier Is a dummy area or not (step ST 11). 
[0414] When any readout from the area of the partner of communication for the communication processing, a certain 
value not having any bad influence on the processing Is stored in advance also in the dummy area. However, in a 
communication, It Is possible to restrict the processing from transmitter to partner of communication to writing only by 
go processing the data necessary for the processing of respective lattice points in a way to be delivered in advance from 
adjacent lattice points without fail. 

[0415] Because only one kind of processing is made regardless If communication Is possible or not, no program is 
required for Judgement and the program size can be reduced. 

[041 q Because only one kind of processing is made regardless if communication is possible or not, it is possible to 
60 control deviations in the computation load for the respective lattice points and avoid any excessive progress of the 
processing only for lattice points with a lot of directions unavailable for communication, and thus realize contraction of 
program size and increase of processing speed even in a case where synchronized processing among respective 
lattice points, etc. Is necessary. 
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[0417] Next, explanation will be given on the twenty-seventh embodiment of the present invention. 
[0418] lb shorten the delay time In the exchange of data, when mutually adjacent lattice points are assigned over 
two element processors , it is effective for the element processors on both sides to mutually have the data of the partner 
in duplicate. In that case, when the data for those lattice points has been renewed, it la necessary to send new data 
8 to the adjacent program without fail to guarantee the consistency of the data of the element programs on both sides. 
[041 9] However, sending data to the adjacent element processors each time when the data is renewed unnecessarily 
increases the volume of communication between processors and have negative influences on the essential processing 
when the difference of the renewed data is very small. 

[0420] Fig. 45 is a conceptual drawing showing the situation of communication between mutually adjacent proces- 

10 sors. In the drawing, the element processor 3 on the left side performs the computation of lattice points No. 0 to No. 
n while the element processor 3 on the right side performs the computation of lattice points No. n+1 to No. m. As shown 
in Fig. 45, when the element processor 3 on the left side sends the data of results of computation of lattice point No. 
n to the adjacent element processor 3 for example, such bad Influence can be avoided by comparing the data A to be 
transmitted to the adjacent element processors with the data B of the previous generation and stop the sending If they 

16 are identical or have a difference no larger than a certain value. 

[0421] While the data B is assumed to be that of the previous generation (namely, data of results of computation of 
previous time) here, this data B may also be a data transmitted previously to the adjacent element processor 3 because 
there is a fear that differences may accumulate even If the quantity of change produced each time is very small. 
[0422] Next, explanation will be made on the simplification of the program for solving equations according to the 

20 twenty-eighth embodiment of the present invention. 

[042^] Fig. 46 is a plan view showing the relation between the computing area for analyzing equations and the lattice 
points assigned to the computing area. Lattice points of one external circle are considered on the outside of the lattice 
points requiring processing surrounded by solid line. A memory area is assigned for the respective lattice points of one 
external circle, and numbers are given under the respective lattice points as lattice point numbers. 

25 [0424] When the lattice point number of the transmitter is n, the lattice point numbers in the respective directions 
can be obtained with the computation of Formula 38 and there is no need of any special processing even for the lattice 
points at the edge. 

30 Lattice point number (East direction) = n + 1 

Lattice point number (West direction) = n - 1 
Lattice point number (South direction) = n - 10 

Lattice point number (North direction) = n + 10 (38) 

[0425] Supposing that the data for lattice points is secured for each lattice point by a quantity of R entry in the order 
of lattice points from the address S, the forefront address In the data area of the lattice points of the partner of com- 
40 munlcation can be obtained with Formula 39, if the lattice point of the partner of communication obtained with the above 
computation Is put as D. The communication processing is the same as that in the twenty-fifth and the twenty-sixth 
embodiments. 

45 DXR + S (39) 

[0420] In the case where the communication processing includes readout from the area of the partner of communi- 
cation, a value not having any bad influence on the processing is stored in advance In the respective dummy areas. 
This can be realized by such means as eliminating any large Influence by providing the dummy area with the same 

so data as that of the adjacent lattice points which get the wall data. Moreover, if there is any value serving as a unit 
element for the respective operations, the above protection can also be realized by setting that value. 
[0427] While the lattice point numbers in the respective directions were obtained by calculation in this embodiment, 
this method can also be applied to any embodiment in which identifiers of lattice points in the direction of communication 
are stored in the memory in the same way as the twenty-fifth embodiment. 

ss [0428] While explanation was given on a case where memories of blocked areas are assigned to the respective 
lattice points In the twenty-fifth to twenty-seventh embodiments, this method can also be applied to any embodiment 
In which areas are secured In blocks for respective elements, i.e. case where areas are secured In blocks for respective 
elements such as pressure value, flow velocity, etc and that the respective areas store the respective elements in the 
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order of lattice point numbers. 

[0429] In a computation system capable of obtaining data of adjacent lattice points by assigning a data memory area 
for each lattice point and referring to said memory, it is possible to realize simplification of processing program, com- 
pression of program size and increase of processing speed by assigning, to the lattice points located at the border, 
8 virtual data areas simulating the external lattice points. 

[0430] When securing areas by arranging them in blocks for respective elements, it becomes possible to improve 
the hitting ratio of cash end increase the processing speed by arranging in blocks the elements to which to have frequent 
access. 

[0431] Next, explanation will be given on the twenty-ninth embodiment of the present invention. 
10 [0432] in solving a differential equation, the object system is divided into small lattice points. In that case, however, 
there is a fear of extended time for reaching convergence, because a time proportional to the number of points from 
the border to the respective lattice point nodes is required for the state of lattice points determined by the boundary 
conditions to spread to all the lattice points and stabilize. 

[0433] To avoid such Inefficiency, there Is a way to improve the speed of convergence by alternate use of coarse 
18 node and fine node. Fig. 47 Indicates computed lattice points in the neighborhood of the border of computation area 
showing an example for Improving the speed of convergence by alternately using coarse nodes end fine nodes. In the 
drawing, 125 are lattice points on the border of the computation area, 126a are lattice points Inside the computation 
area at the time of connection by fine nodes, 1 26b are lattice points Inside the computation area at the time of connection 
by coarse nodes, 7a is a connection between element compressors for fine node connection and 7b is a connection 
20 between element compressors for coarse node connection, and those lattice points and the respective element proc- 
essors correspond to each other at 1:1. 

[0434] In this embodiment, all lattice points inside the computation area 126a and 126b are connected by means of 
connection 7a between element compressors for fine node connection at the time of fine node connection, and the 
internal lattice points 126b for coarse node connection are connected by Jumping a plural number of internal lattice 

25 points by means of connection 7b between element compressors for coarse node connection at the time of coarse 
node connection. Here, because the internal lattice points 126b for coarse node connection play the role of computed 
lattice points in both fine node operation and coarse node operation at the time of coarse node connection, the data 
memory inside the lattice points 126b plays the role of exchanging information between coarse node operation and 
fine node operation. And, In the computation, fine node operation and coarse node operation are used alternately. In 

30 such construction, the boundary conditions of the lattice points 125 on the board are transmitted to the Inside of the 
computation area 3 times faster than during the fine node operation by means of the connection 7b between element 
compressors for coarse node connection, and then transmitted to the internal lattice points 126a for fine node connec- 
tion in the neighborhood by fine node connection to perform detailed computation. 

[0435] Therefore, according to this embodiment, it is possible to increase the speed of convergence by making the 
38 boundary conditions penetrate into the computation area immediately through the use of coarse nodes and then obtain 
results of operation of high computation accuracy with the use of fine nodes, to eventually achieve high-accuracy high- 
speed operation. 

[0436] While an example In which the lattice points of fine node connection are connected at every 2 points at the 
time of coarse node connection was given In this embodiment, similar effects can be expected also with connection 

40 made by skipping whatever number of points as a matter of course. Moreover, while the coarseness of lattices was 
given In 2 grades or fine node connection and coarse node connection in this embodiment, the same effects can be 
obtained also with the use of lattices in a larger number of grades of coarseness. Furthermore, while in this embodiment 
explanation was given on a case where the computation at coarse node connection and the computation at fine node 
connection are made alternately one by one, It is also all right to make the compulation at coarse node connection and 

48 the computation at fine node connection by alternating a plural number of times of one and the other. 

[0437] While In the above example explanation was given on a case where the computation at coarse node connec- 
tion and the computation at fine node connection are used alternately, H is also possible to adopt a method of gradually 
improving the accuracy of the solution after reaching convergence with coarse nodes. 

[0438] Namely, this method consists, In Fig. 47, in performing computation repeatedly until a convergence solution 
80 is obtained by using the internal lattices points 126b and the lattice points on border 1 25 at the time of coarse node 
connection connected with the connection 7b between element processors at coarse node connection and, after that, 
either giving the solution of the Internal lattices points 126b at the time of coarse node connection as Initial conditions 
or giving the solution of the Internal lattices points 126b at the time of coarse node connection as initial conditions by 
proportionally distributing it according to the space distribution of the fine nodes and then repeatedly performing the 
88 computation by using the Internal lattices points 126a connected with the connection 7a between element processors 
at fine node connection, the internal lattices points 126b at the time of coarse node connection and the lattice points 
on border 125 to take the convergence solution as final solution. 

[0439] By adopting this method, It becomes possible to sharply accelerate the computation for reaching a rough 
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convergence. Namely, this makes it possible to not only avoid uselessly making local calculations but also reach a 
convergence stably and reach the final convergence quickly. 

[0440] While in this embodiment explanation was given on a case where the lattice points and the respective element 
processors correspond to each other at 1:1, the same effects can be obtained also In a case where a space area in 

* which a plural number of lattice point groups exist are made to correspond (assigned) to each element processor. 
[0441] Next, explanation will be given on the thirtieth embodiment of the present invention. 
[0442] In the above twenty-ninth embodiment, explanation was given on a case where coarse node connection is 
realized by skipping the connection between mutually adjacent element processors with jumper wiring. 
[0443] I n this embodiment, explanation will be given on a case where those coarse nodes are realized with S/W only. 

io [0444] In a case where a data memory area is secured for each lattice point and the data communication between 
lattice points Is realized through mutual access, by lattice points requiring communication, to this data memory area 
secured for each lattice point, as explained in the twenty-fifth embodiment of the present Invention, it is possible to 
perform data communication with lattice points other than adjacent lattice points i.e. realize coarse node connection 
with S/W only by having a table indicating the data memory area in the respective directions of communication for each 

" lattice point and storing, in each entry of this table, the address of the data memory area secured for the lattice points 
skipped by the desired number of lattice points. 

[0445] Therefore, It is possible to perform fine node processing by setting the table indicating the data memory area 
In the respective directions for making communication in a way to realize coarse node connection at the starting stage 
of solving of equations and then changing that table, after convergence of the processing, in a way to indicate the data 
20 memory area secured for the adjacent lattice points so as to realize fine node processing. 

[0446] Moreover, by rewriting the table as required by the same method as above, it is also possible to alternately 
execute coarse node processing and fine node processing. 

[0447] Furthermore, by having 2 pieces of said table indicating the data memory area In the respective directions 
for making communication i.e. one for coarse node processing and another for fine node processing and by executing 
20 the program for coarse node processing and the program for fine node processing in parallel, it is also possible to 
execute coarse node processing and fine node processing in parallel. 

[0446] Even in the case of processing in which the lattice point for making communication changes in such a way 
that the adjacent lattice point for making communication Is a little apart during a coarse node processing but Is close 
at hand during a fine node processing, etc., it is possible to perform the processing with one exactly same program by 
30 just rewriting the area of the identifier Indicating the data memory area of the lattice point in the direction of communi- 
cation. 

[0449] While in the above embodiment explanation was given on a case where the convergence speed is Improved 
with simultaneous execution of coarse node and fine node, there is no need of having one same algorithm for both 
coarse node processing and fine node processing. By having separate programs for coarse node and fine node, it 
3* becomes possible to use different algorithms for coarse node processing and fine node processing. 

[0450] For example, use of an algorithm of high convergence speed but of inferior accuracy in coarse node further 
improves the convergence speed of the system as a whole. 

[0451] In the above embodiment relating to SIMPLE method, the current data of adjacent lattice points Is referred 
to for performing computation regarding the lattice points concerned. However, by having no less than 2 generations 

40 of data regarding adjacent lattice points and omitting the computation of nodes concerned when the fluctuation between 
generations Is no more than a prescribed value, it becomes possible to reduce the total amount of computation and 
sharply shorten the computation time. Moreover, omission of computation of lattice points with a fluctuation between 
generations no larger than a prescribed value also produces an effect of enabling use of obtained calculated resource 
for the processing of other lattice points. "Data of 2 generations* mentioned here refers to the "data based on the results 

45 of computation in the previous step of the repeated computation" and the "data based on the results of computation 
in the step before last of the repeated computation" regarding adjacent lattice points. 

[0452] In the SIMPLE method mentioned above, conditions for omitting the computation at each lattice point are 
given. The difference of data between generations of adjacent lattice points was of a fixed value. However, by setting 
the tolerance of the difference of data between generations which Is the condition for omitting computation, at a large 
60 value but gradually contracting It in the course of the convergence computation, it is possible to sharply accelerate the 
computation for reaching a rough convergence. Application of this method makes it possible to avoid making any 
useless local calculations and reach the final convergence quickly. 

[0453] Next, explanation will be given on the thirty-first embodiment of the present invention. 
[0454] The conventional parallel computers, which were mainly intended to minimize the number of time-iterations, 
03 had a defect that the convergence processing determines the speed of the entire computation because the computer 
adopted the consecutive means of judging convergence or not at the point in time when the computation of all lattice 
points at certain time step is over and, if convergence is not reached yet, starting the computation of the respective 
lattice points again by time marching. 
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[0455] Because the relation of (computation cost at each lattice point CA) > (cost of processing for judgement of 
convergence CC) Is established In a fluid simulation generally, this construction can be adopted if some small quantity 
of buffer equipment is prepared. Moreover, if the judgement of convergence is formed partially, only a small quantity 
of buffer will be enough. Putting the number of Iterations up to convergence asN, N x (CA + CC + a) can be shortened 
* to N x CA + a, a representing the communication cost 

{0456] Fig. 48 is a construction drawing of multi-processor showing an embodiment of the present invention realized 
for Improving the speed of simulation by cutting off the mutual dependence between the simulation and the processing 
for judgement of convergence. 

[0457] in the drawing, 3k are element processors for performing operations, 160 are convergence judging units for 
10 judging of convergence in each column based on the information of convergence flag output by the respective element 
processors 3k, 1 is the host computer for integrally controlling the element processors 3k and the convergence judging 
units 180, 1 04 are computation Information buses in direction of line for exchanging data for the operations between 
element processors 3k in the direction of line, 105 are computation Information buses in direction of column for ex- 
changing data for the operations between element processors 3k In the direction of column, 181 are convergence flag 
13 information buses for transmitting the convergence flag output by the respective element processors 3k to the conver- 
gence judging units 180, and 182 are convergence information buses for transmitting the convergence information of 
the respective convergence judging units 180 to the host computer. 
[0458] Next, explanation will be given on actions. 

[0459] The respective element processors 3k judge convergence of the computation for each step of repetition of 
20 respective computations or for a certain number of steps of respective computations against the results of Internal 
computation In themselves and, if convergence is reached, output the convergence judgement flag as 1 to the con- 
vergence flag Information buses 181, for example. The convergence judging units 180 collect the information of con- 
vergence judgement flag output from the respective element processors 3k and, when all the element processors 3k 
controlled by the respective convergence judging units 180 judged that convergence has been reached, output con- 
25 vergence Information individually to the convergence information buses 182. And, finally, the host computer 1 judges 
convergence of the whole part from the convergence information of ail the element processors 3k require all the con- 
vergence information buses 182 to terminate the operation. 

[0460] By constructing the system this way, the element processors 3k can execute operations In parallel and In- 
crease the speed of operations without being disturbed In respective operations. 

30 [0461] While in this embodiment the convergence judging units 160 divide the element processors arranged in the 
shape of lattice Into groups by Dnes and control the convergence Information of such groups, the same effects can be 
obtained regardless if this grouping is made by columns or at random as a matter of course. 
[0462] Next, explanation will be given on the thirty-second embodiment of the present invention. 
[0463] Fig. 49 is a construction drawing of multi-processor showing an embodiment realized for improving the speed 

35 of simulation by further cutting off the relation of mutual dependence between the simulation and the processing for 
judgement of convergence. 

[0464] In the drawing, 3k are element processors for performing operations, 180 and 180a are convergence judging 
units for judging the situation of convergence In each column based on the information of convergence flag output by 
the respective element processors 3k, 1 1s the host computer for integrally controlling the element processors 3k and 
*o the convergence judging units 1 80, 1 04 are computation information buses In direction of line for exchanging data for 
the operations between element processors 3k in the direction of line, 105 are computation Information buses in di- 
rection of column for exchanging data for the operations between element processors 3k in the direction of column, 
181 are convergence flag information buses for transmitting the convergence flag indicating the situation of conver- 
gence of the respective element processors 3k to the convergence judging units 180, and 162 are convergence Infor- 
matton buses for connecting between the convergence judging units 1 80 the host computer and transmitting the con- 
vergence information of the convergence judging units 180. 

[0465] In this construction, the convergence judging units 180 arrange the convergence information buses 182 in a 
way to make transmission of information only unldirectionalty between mutually adjacent convergence judging units. 
And, they start sending out convergence information when the computation in all the element processors 3k controlled 
so by the convergence judging unit a farthest from the host computer 1 converged. Other convergence judging units 180 
send out convergence information when both the convergence information from the adjacent convergence judging 
units 180 and their own judgement about convergence of the computation in all the element processors 3k controlled 
by themselves are ready. 

[0466] By arranging in this way, the host computer 1 can immediately judge the completion of computation at the 
<u point In time when the convergence Judging unit 1 80 closest to the host computer 1 sent out convergence Information 
to the host computer 1 . This makes it possible to further increase the speed of computation without disturbing the 
processing in the element processors 3k and the host computer 1 . 

[0487] While In this embodiment the convergence judging units 160 divide the element processors arranged in the 
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shape of lattice into groups by Ones end control the convergence information of such groups, the same effects can be 
obtained regardless If this grouping is made by columns or at random as a matter of course. Moreover, the same effects 
can be obtained even rf the connection between convergence judging units 180 is made regardless of their distance 
from the host computer 1. 
0 [0468] Next, explanation will be given on the thirty-third embodiment of the present invention. 

[0469] As explained In the thirty-second embodiment, transmitting convergence flag through convergence flag infor- 
mation buses constructed outside the element processors produces losses of communication time. Fig. 50 is a block 
diagram showing the construction of element processors according to the thirty-third embodiment realized for the pur- 
pose of shortening the transmission time of convergence flag. Moreover, Fig. 51 is a block diagram showing the con- 
to struction of a multi-processor composed of the element processors indicated in Fig. 50. 

[Q470] In Fig. 50, 3m Is an element processor, 5 is a processing unit for performing computation formed inside the 
element processor 3m, 6 Is a memory unit for storing the computation information necessary for the computation of 
the processing unit and provided inside the element processor 3m, 163 is a convergence Judging unit, provided inside 
the element processor 3m, forjudging the situation of convergence of th e processing unit 5 Inside the element processor 
15 3m it belongs to, the situation of convergence in the adjacent element compressor 3m and the situation of convergence 
of eO element compressors 3m, 104 are computation information buses in direction of line for exchanging data for the 
operations between the element processor 3m in the direction of line, 1 0S are com putation information buses in direction 
of column for exchanging data for the operations between the element processors 3m in the direction of column, 181 
are convergence flag Information buses for transmitting the convergence flag to the convergence Judging unit inside 
20 183 the respective element processors 3m by connecting the convergence Judging unit 183 Inside the respective ele- 
ment processors 3m in the form of a ring, 182 is a convergence information bus for transmitting the convergence 
Information of all the element processors 3m to the host computer 1 by connecting between at least one of the con- 
vergence judging units 1 83 of all the element processors 3m and the host computer 1 , 1 64 is an internal flag information 
bus for transmitting the convergence flag output by the processing unit 5 to the convergence Judging unit 180, and 106 
20 is an Internal flag information bus for exchanging data necessary for computation and the data of results of computation 
between the processing unit 5 and the memory 6. 

[0471] In Fig. 61, 1 is the host computer for integrally controlling the element processors 3m, 3k. The respective 
processing units 6 Judge convergence of the computation for each step of repetition of respective computations or for 
a certain number of steps of respective computations and, if convergence is reached, output the convergence Judge- 

30 ment flag as 1 to the convergence judging units 163 through the internal flag information bus 1 84, for example. More- 
over, the convergence Judging unit 183 of the respective element processors 3k are connected in the shape of a ring 
through convergence flag information bus. The Judgement of convergence is started from a predetermined specific 
element processor. Simultaneously as the convergence judging unit 163 of the element processor 3m receives an 
Internal flag Information bus indicating convergence from the processing unit 5 inside, it transmits the convergence 

35 judgement flag as 1 to the convergence judging units 1 83 of the adjacent element processor 3k through the convergence 
flag information bus, for example. Here, the convergence flag information bus 181 connected in the shape of a ring is 
provided with a function of transmitting Information in one direction. Said convergence judging unit 163 of the adjacent 
element processor 3k takes the logical product (AND) of the Internal Information flag (= 1) from the processing unit of 
the element processor 3k it belongs to and the convergence flag (= 1) from the convergence judging unit 183 of the 

40 adjacent element processor 3m and, If It is found true (= 1), former sends out convergence flag 1 to the convergence 
judging unit 163 of the adjacent element processor 3m through the convergence flag information bus 181. 
[0472] The respective convergence Judging units 183 connected in the shape of a ring perform the procedure of 
taking the logical product (AND) of the internal information flag (= 1) from the processing unit of the element processor 
3k they belong to and the convergence flag (= 1) from the convergence Judging unit 183 of the adjacent element 

4S processor 3k and, if It is found true (= 1), former sending out convergence flag 1 to the convergence Judging unit 183 
of the adjacent element processor 3k through the convergence flag Information bus, one after another In the order of 
connection of the convergence flag information buses 1 81 . Finally, at the point in time when the convergence flag (= 
1) is returned to the convergence judging unit 183 of the element processor 3m which first transmitted the convergence 
flag as 1 , (the convergence judging unit 183) Judges that ail the element processors 3k, 3m converged and send out 

60 convergence Information (= 1) to the host computer 1 through the convergence information bus 162 to complete the 
computation. 

[0473] By arranging this way, it Is possible to shorten the transmission time of information regarding convergence 
and thus achieve increase of speed of computation, because the internal flag information bus 181 is provided inside 
the respective element processors 3k, 3m. 
a* [0474] While in the above embodiment the Internal Information flag, the convergence flag Information and the con- 
vergence information were explained respectively as logical values of 0 or 1 , whatever value may be given to them if 
only Judgement of convergence or not can be made. Moreover, it is all right if a convergence information bus 182 is 
provided In one element processor 3m and there is no need of installing it in ail the element processors. 
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[0475] Moreover, any of the element processors may be selected as specific element processor 3m. 

[0476] Next, explanation will be given on the thirty-fourth embodiment of the present invention. 

[0477] Fig. 52 Is a block diagram showing the construction of a multi-processor indicating an embodiment for grouping 

all the element processors Into a plural number of groups and forming an Information transmission link of convergence 

8 flag Information bus for each group to realize parallel execution of judgement of convergence. 

[0478] In Fig. 52, 181 is a convergence flag information bus for grouping all the element processors into a plural 
number of groups end connecting the convergence judging units 183 inside the respective element processors 3k, 3m 
in the groups in the shape of a ring to transmit the convergence flag to the convergence judging units 183 inside the 
adjacent respective element processors 3k, 3m, and 182 is a convergence information bus for transmitting the con- 

10 vergence Information of the respective groups by connecting between the convergence judging unit 183 and the host 
computer 1 and provided In one specific element processor 3m in each group. 

[0479] In each group, the convergence judging unit 1 83 of a specific element processors 3m in which the convergence 
Information bus 162 Is provided receives Internal Information flag Indicating convergence from the processing unit 5 
Inside and, at the same time, transmits convergence flag 1 , for example, if convergence is reached, to the convergence 

18 judging units 183 of the adjacent element processors 3k through the convergence flag information bus. Here, the 
convergence flag information bus 181 connecting between the convergence judging units 183 of the element proces- 
sors in the group In the shape of a ring is provided with a function of transmitting information in one direction onfy. Said 
convergence judging unit 183 of the adjacent element processor 3k takes the logical product (AND) of the internal 
Information flag (= 1) from the processing unit of the element processor 3k it belongs to and the convergence flag (= 

20 1) from the convergence Judging unit 183 of the adjacent element processor 3m and, if it Is found true (= 1), further 
sends out convergence flag 1 to the convergence judging unit 183 of the adjacent element processor 3k through the 
convergence flag information bus 161. 

[0480] The respective convergence judging units 183 connected in the shape of a ring perform the procedure of 
taking the logical product (AND) of the internal information flag (= 1 ) from the processing unit of the element processor 

*9 3k they belong to and the convergence flag (=1) from the convergence judging unit 183 of the adjacent element 
processor 3k and, if It is found true (= 1), further sending out convergence flag 1 to the convergence judging unit 183 
of the adjacent element processor 3k through the convergence flag information bus, one after another in the order of 
connection of the convergence flag information buses 1 81 . Finally, at the point In time when the convergence flag (= 
1) Is returned to the convergence Judging unit 183 of the specific element processor 3m which first transmitted the 

30 convergence flag as 1 , (the convergence judging unit 1 83) judges that ell the element processors 3 in the group con- 
verged and send out convergence Information (= 1) to the host computer 1 through the convergence Information bus 
1 82. The host computer 1 judges that the computation of all element processors 3 converged at the point in time when 
it received convergence information (= 1) from all the groups and completes the computation. 
[0481] By constructing the system this way, the judgement of convergence for each group is formed in parallel and 

39 the time for judgement of convergence can be shortened. 

[0482] While in the above embodiment the convergence judging units 163 divide the element processors arranged 
In the shape of lattice Into groups and control the convergence information of such groups, the same effects can be 
obtained regardless if this grouping is made by columns or at random as a matter of course. Moreover, while in the 
above embodiment the internal Information flag, the convergence flag information and the convergence information 

<Q were explained respectively as logical values of 0 or 1 , whatever value may be given to them if only judgement of 
convergence or not can be made. Moreover, It is all right if a convergence information bus 182 is provided in one 
element processor 3m and there Is no need of installing it in all the element processors. 
[0483] Moreover, any of the element processors may be selected as specific element processor 3m. 
[0484] Next, explanation will be given on the thirty-filth embodiment of the present Invention. 

49 [0485] Fig. 63 is a block diagram showing the construction of element processors constituting a multi-processor for 
realizing reduction of the load on host computer at the time when judgement of convergence is made in parallel. More- 
over, Rg. 54 is a general construction drawing of the multi-processor composed of the element processors indicated 
In Fig. 53. 

[0488] In Rg. 53, 181 Is a convergence flag Information bus for grouping all the element processors 3k, 3p, 3q into 
80 a plural number of groups and connecting the convergence judging units 183 Inside the respective element processors 
in the groups in the shape of a ring to transmit the convergence flag to the convergence judging units 183 inside the 
adjacent respective element processors, 182 is a convergence information bus provided In at least one element proc- 
essor 3p, 3q in each group and connecting, in the shape of a ring, the convergence judging unit 183 of the element 
processors 3p, 3q in which it la provided, and 186 is an end information bus provided in at least one element processor 
88 3p of all groups and connecting between the convergence judging unit 183 of the element processor 3p and the host 
computer 1. 

[0487] When grouping is made as shown in Rg. 54, the convergence judging unit 183 of a specific element processors 
3p in the group in which the convergence Information bus 1 62 is provided starts judgement of convergence and, upon 
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receipt of internal information flag indicating convergence from the processing unit 5 inside, transmits convergence 
flag 1 , for example, if convergence Is reached, to the convergence judging units 1 83 of the adjacent element processors 
3k through the convergence flag information bus. Here, the convergence flag information bus 1 81 connecting between 
the convergence judging units 183 of the element processors 3k, 3p in the group in the shape of a ring is provided with 

s a function of transmitting Information in one direction only. 

[0488] Said convergence judging unit 183 of the adjacent element processor 3k takes the logical product (AND) of 
the internal Information flag (= 1 ) from the processing unit of the element processor 3k it belongs to and the convergence 
flag (= 1 ) from the convergence judging unit 183 of the adjacent element processors 3p, 3q and, if it is found true (= 
1) v further sends out convergence flag 1 to the convergence judging unit 183 of the adjacent element processor 3k 

10 through the convergence flag Information bus 181 . 

[0489] The respective convergence Judging units 183 connected In the shape of a ring perform the procedure of 
taking the logical product (AND) of the internal Information flag (= 1) from the processing unit of the element processor 
3k they belong to and the convergence flag (= 1) from the convergence judging unit 183 of the adjacent element 
processor 3k and, If it is found true (= 1), further sending out convergence flag 1 to the convergence judging unit 183 

15 of the adjacent element processor 3k through the convergence flag information bus, one after another in the order of 
connection of the convergence flag information buses 181. Finally, at the point in time when the convergence flag (= 
1) Is returned to the convergence Judging unit 183 of the specific element processors 3p, 3q which first transmitted the 
convergence flag as 1 , (the convergence judging unit 183) judges that the element processor 3p or 3q in the group 
converged. 

20 [0490] Moreover, when the convergence judging unit 1 83 of any specific element processor inside the element proc- 
essors 3p provided with an end Information bus judged that the element processors 3p, 3k to which it belongs con- 
verged, transmits convergence Information (= 1) through the convergence Information bus connected in the shape of 
a ring. Here, the convergence flag information bus 1 82 connecting between the convergence judging units 1 83 of the 
element processors 3p, 3q in the group In the shape of a ring is provided with a function of transmitting information in 

2ff one direction only. 

[0491] The element processor 3p of a predetermined group transmits (information) to the element processor 3q 
provided with a convergence Information bus 182 of the adjacent group. The convergence judging unit 163 of the 
element processor 3q provided with a convergence Information bus 182 of said adjacent group takes the logical product 
(AND) of the convergence information (= 1) of the group it belongs to and the convergence information (= 1) of the 

30 adjacent element group and, if it is found true (= 1), further sends out convergence information 1 to the convergence 
Judging unit 183 of the adjacent element processor 3q through the convergence flag Information bus 182. 
[0492] As explained above, the respective convergence judging units 183 connected in the shape of a ring perform 
the procedure of taking the logical product (AND) of the convergence information (= 1) of the group they belong to and 
the convergence Information (= 1 ) from the adjacent group and, if it is found true (= 1 ), further sending out convergence 

35 information 1 to the convergence judging unit 183 of the element processor 3q provided with a convergence Information 
bus 182 of the adjacent group through the convergence flag information bus 182, one after another in the order of 
connection of the convergence information buses 162. 

[0493] Finally, at the point In time when the convergence Information (= 1) Is returned to the convergence judging 
unit 183 of the specific element processor 3p which first transmitted the convergence flag as 1, (the convergence 
40 judging unit 163) judges that all the element processors 3k, 3p, 3q In the group converged and sends out end Information 
(= 1) to the host computer 1 through the end information bus 186. The host computer 1 judges, at the point in time 
when it received the end Information (= 1 ), judges that the computation of ail element processors 3k,3p,3q converged, 
and completes the computation. 

[0494] By constructing the system this way, it becomes possible to alleviate the load of the host computer at the time 
45 when the judgement of convergence is made in parallel, and to farther Increase the computation speed. 

[0495] While In this embodiment the convergence Judging units 160 divide the element processors arranged in the 
shape of lattice into groups by Ones and control the convergence information of such groups, the same effects can be 
obtained regardless if this grouping is made by columns or at random as a matter of course. Moreover, while in the 
above embodiment the Internal Information flag, the convergence flag Information and the convergence information 
60 were explained respectively as logical values of 0 or 1, whatever value may be given to them if only judgement of 
convergence or not can be made. Furthermore, It fe all right if a convergence information bus 182 is provided in one 
element processor 3p or 3q and there is no need of Installing it In all the element processors. In addition, ft is ail right 
if an end information bus 186 is provided In one element processor 3p and there is no need of installing it in aD the 
element processors. 

55 [0496] Moreover, any of the element processors Inside the group may be selected as specific element processor 3p 
and 3q, and any element processor representative of the group may be selected as specific element processor 3p. 
[0497] Next, explanation will be given on the thirty-sixth embodiment of the present invention. 
[0498] When computation Is executed on a multi-processor, there are cases, though quite rarely, where non stable 
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points (non converged points) patrol. This kind of so-called "oscillation phenomenon" must be detected. 

[0499] Detection of "oscillation phenomenon" can be made by judging convergence for the second time after once 

confirming convergence in all element processors, and this makes it possible to judge convergence with a fairly high 

accuracy. 

[0600] To be concrete, an end flag is transferred In order according to a predetermined route to form judgement of 
convergence of the entire computation. 

[0501] Fig. 55 Is a conceptual drawing showing an example of the route for transferring end flag. In Rg. 55, 190 and 
191 are lattice points obtained as a result of digitizing of a two-dimensional space. Moreover, the arrow mark line 192 
linking the respective lattice points indicates the route through which the end flag is transferred. 
[0502] The processing forjudging convergence at the respective lattice points will be explained hereafter. When 
convergence is confirmed for the first time after the arrival of the end flag from the adjacent lattice point, the end flag 
Is transferred to the subsequent lattice point. 

[0503] Fig. 56 indicates a flow chart for the respective lattice points to cany the end flag to the adjacent lattice point, 
except for the processing of the lattice point 1 90 which serves as the starting point of the end flag . 
[0504] As shown in Rg. 56, first, judgement Is made whether or not a message carrying the end flag arrived from 
the adjacent lattice point (step ST20), If a message carrying the end flag arrived from the adjacent lattice point, (the 
convergence judging unit) drives the convergence judging processing of the own lattice point, waits until the processing 
of the own lattice point reaches convergence (step ST21) and, at the point in lime where convergence is reached, 
transfers a message carrying the end flag to the next lattice point through the prescribed route (step ST22). 
[0505] Fig. 57 Indicates a flow chart for the sending out of the end flag at the lattice point serving as the starting point 
for the transfer of end flag. As shown in the chart, (the convergence judging unit) judges convergence of the processing 
of the own lattice point only at the lattice point serving as the starting point of the end flag (step ST26) and, If convergence 
la reached, transfers a message carrying the end flag to the next lattice point through the route indicated in Rg. 55 
(step ST26). 

[0506] After passing through all the lattice points as described above, the end flag is transferred to the host computer 
and convergence is confirmed. In this multi-processor of the thirty-fifth embodiment, after once convergence is con- 
firmed, the above process la repeated to ensure the judgement of convergence. 

[0507] As explained above, this embodiment is realized In such a way that the convergence judging processing of 
the own lattice point Is started with Input of a message carrying the end flag In the processing at respective lattice 
points and that a message carrying the convergence judging flag is output if convergence is reached in the processing 
of the own lattice point, and this makes it possible to judge convergence of the entire system correctly even In the 
presence of any unstable lattice points. 

[0506] Moreover, while in this embodiment explanation was given on a case where the processing for judgement of 
convergence for the processing of the own lattice at the lattice point serving as the starting point of the end flag is 
constantly driven, the same effects can also be obtained even if the processing for judgement of convergence is started 
from the host computer, as a matter of course. 

[0509] In this kind of Information processing system, the solution of equations is stored by being distributed to the 
storing means such as memory, etc. assigned to the respective lattice points. For that reason, it Is necessary to Integrate 
the results of computation stored separately, perform processing such as visualization, etc. and display them on a 
monitor. 

[0510] This embodiment indicates a method of integrating the solution of equations distributed to the respective 
lattice points on the host computer. 

[0511] In the thirty-sixth embodiment, explanation was given on the method of judgement of convergence about 
completion or not of the computation in the entire system. Namely, (the convergence judging unit) sends the end flag 
in order according to the prescribed route and, after folly patrolling of all lattice points, sends out a message indicating 
the convergence of the entire system to the host computer. The host computer performs processing for verifying the 
completion of computation by receiving said message. 

[0512] Fig. 56 is a flow chart showing the relationship between the processing for judgement of convergence at 
respective lattice points and sending out of solution to differential equations. Explanation will be given on the procedure 
of judgement of convergence and the sending out of solution to differential equations based on Fig. 66. First, the 
respective lattice points receive the end flag from the adjacent lattice point (step ST20). Next, they judge convergence 
of the processing of the own lattice points (step ST21) and, after confirmation of convergence, send out the end flag 
along the predetermined route (step ST22) while sending, at the same time, the results of computation at the own 
lattice points, i.e. the solution of differential equations to the host computer. 

[0613] Moreover, Rg. 68 Indicates a format of message packet RAC having the solution of differential equations to 
be sent out to the host computer. The field of the message packet is composed of the respective fields of left data (first 
operand) pa3, right data (second operand) pa4, destination PE number pal storing the element processor numbers 
to which to transfer the message packet and destination node numbers pa2 storing the address of the command to be 
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executed next 

[0514] The left data field, right data field, destination element processor number and destination node number in the 
message packet according to this embodiment are given as follows: The left data field is a value (functional value) at 
the lattice point concerned of the solution of differential equations. The right data field Is the identification number of 
Q the lattice point concerned. The destination element processor (PE) numbers are numbers assigned by the host com- 
puter to the element processors and the destination node numbers are results. The results are predetermined values 
and indicate that the message packet stores the results of computation at respective lattice points. 
[051 6] If such message packet Is received by the host computer, the system makes the following actions: 

10 (1) Checks the filed of the destination node number and, because the value is a result, confirms that the message 

packet concerned is the results of computation at the respective lattice points. 

(2) Reads out the number of lattice point stored in the right data field and the solution of a differential equation 
stored In the left data. 

(3) Determines the address by a predetermined method from the lattice point number and stores the value of the 
is solution of a differential equation at the lattice point concerned in storing means such as memory, etc. 

[0516] As explained so far, this embodiment is realized in such a way that (the respective lattice points) transfer the 
results of computation at the own lattice points (solution of differential equations) to the host computer as soon as they 
confirm convergence of the computation at respective lattice points, and has the effect of getting the data of final results 
20 ready on the host computer at the point in time when the completion of the solving of a differential equation under 
execution is confirmed. 

[0517] While the twenty-ninth embodiment premises assignment of the processing of one lattice point (program) to 
one element processor, the twenty-fifth to twenty-eighth embodiments premise that the processing of a plural number 
of lattice points is assigned to one same element processor and that the exchange of data between lattice points is 
25 made through the data memory inside one same element processor. The respective embodiments will be explained 
In detail hereafter. 

[0518] Explanation wlD be given on the thirty-seventh embodiment of the present invention. As another method of 
detecting circulation of unstable lattice points (non converged points) conceivable is a method of Judgement of con- 
vergence which consists In confirming convergence of all lattice points In multiplex ways by transferring the end flag 
30 through a plural number of routes passing through all the lattice points. 

[0619] To be concrete, judgement of convergence can be formed in multiplex ways by exchanging the end flag in 2 
directions of longitudinal and transversal directions. 

[0520] Fig. 59 indicates a conceptual drawing showing the route of transfer of the end flag in this embodiment 
[0521] In the drawing, 1 83, 194 and 195 indicate square lattices in two-dimensional space, the arrow mark indicated 
30 with solid line Indicates the route of transfer of the first end flag 196 and the arrow mark indicated with dotted line 
Indicates the route of transfer of the first end flag 197 respectively. In this embodiment, circulation of non converged 
points can be detected by making the routes of transfer of the first and the second end Hags orthogonally intersect 
each other. 

[0522] Fig. 60 Indicates a flow chart for the respective lattice points to cany the end flag to the adjacent lattice points, 
40 except for the processing of lattice points 193, 195 serving as starting points for the end flag. 

[0523] As shown in Fig. 60, judgement Is made first on whether or not a message of judgement of convergence 
arrived from the adjacent lattice point (step ST30). If a message carrying the end flag arrives from the adjacent lattice 
point, the lattice point drives the convergence judging processing of the own lattice point, waits until the processing of 
the own lattice point reaches convergence (step ST31) and, at the point in time where convergence is reached, judges 
45 if the end flag which arrived belongs to the first route of transfer or to the second route of transfer, and transfers a 
message carrying the end flag to the next lattice point through the prescribed desired route (step ST32). 
[0524] Rg. 61 indicates a flow chart for the sending out of end flag at the lattice point serving as starting point for 
the transfer of end flag. As shown in the drawing, the lattice point serving as starting point of end flag first judges 
convergence of the processing at the own lattice (step ST40) and, If convergence is reached, stores the flag showing 
60 that it belongs to the first or the second route of transfer in the convergence message (step ST41), and then sends out 
a message carrying the end flag to the next lattice point through the route Indicted in Rg. 59 (step ST32). 
[0525] At that time, the lattice point stores in the value of the end flag a value corresponding to either the first route 
or the second route to which the end flag belongs. On the basis of this value, other lattice points can determine the 
route of the arriving end flag. 

53 [0526] As explained above, two or more routes for transferring flag forjudging convergence are provided and this 
makes It possible to correctly judge convergence of the entire system even in the case where unstable lattice points 
exist. 

[0627] While In this embodiment a flag value was used tor Indicate on which of two routes the end flag Is transferred, 
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it is apparent that the same effects can be obtained also when programs for processing end flag are provided separately 
so that the program for judging convergence may be started for each transfer route of end flag at the time of arrival of 
the respective end flags. 

[0528] Next, Explanation will be given on the thirty-eighth embodiment of the present invention. Explanation will be 
8 given on the procedure of detecting unstable lattice points In the computation of convergence using multi-processor 
In the case where there is a possibility of existence of unstable lattice points. 

[0529] First, In the same way as in the trtirty^fifth embodiment, the lattice point transfers an end flag to the host 
computer through all the lattice points along the predetermined route. Next, the host computer transmits a message 
for stopping the processing of alt lattice points and, as a result, stops all processing operations. After that, it detects 
10 patrolling of unstable lattice points by checking for convergence again. Accurate judgement of convergence can be 
realized also with this method. 

[0530] Fig. 62 is a flow chart indicating the content of processing relating to the judgement of convergence of the 
respective lattice points In this embodiment. 

[0631] The processing relating to the Judgement of convergence of the respective lattice points will be explained 

15 based on Fig. 62. In this embodiment, a memory or a register for storing the results of judgement of convergence is used. 
[0532] In step ST50, the lattice point judges if a message carrying the end flag arrived from the adjacent lattice point 
or not. if a message carrying the end flag arrived from the adjacent lattice point, the lattice point drives the convergence 
judging processing of the own lattice point, wafts until the processing of the own lattice point reaches convergence 
(step ST51) and, at the point in time where convergence is reached, transfers a message carrying the end flag to the 

20 next lattice point through the predetermined route as shown in Fig. 55 (step ST53). At that time, the lattice point clears 
said memory (step ST52) and starts the program for processing judgement of convergence (step ST54). The program 
for processing Judgement of convergence judges if the processing of the own lattice point is in the state of non con- 
vergence or not and, when it is judged as In the state of non convergence, writes a flag indicating the state of non 
convergence of own lattice point in said memory. 

25 [0533] Moreover, while the processing for judging convergence at the lattice point serving as starting point of a 
message carrying the end flag is, as shown In Fig. 36, basically the same as the processing of the lattice point serving 
as starting point of a message carrying the end flag in the thirty-sixth embodiment, the lattice point dears said memory 
(step ST6 1 ) and drives the program for processing judgement of convergence (step ST63), in addition to that process- 
ing. The program for processing judgement of convergence judges if the processing of the own lattice point is in the 

30 state of non convergence or not and, when it is judged as in the state of non convergence, writes a flag indicating the 
state of non convergence of own lattice point In said memory. 

[0534] After that, a message carrying the end flag is transferred to the host computer. Upon receipt of the message 
carrying the end flag, the host computer transmits a message for stopping the processing of predetermined lattice 
points. 

35 [0535] The lattice point which received said message for stopping the processing stops the processing and, at the 
same time, transmits the message for stopping the processing to the next lattice point along the predetermined route. 
The lattice point which received the message for stopping the processing performs the same stopping operation. After 
the processing of all lattice points is over, the host computer reads out said memory assigned to the respective lattice 
points end, in the case where all results of reading are zero, it means that the object computation converged. If 8ny of 
40 said results of reading Is found to be other than zero, the above-mentioned processing wfll be repeated. 

[0536] Fig. 64 indicates a flow chart for the lattice points to stop processing. The host computer transmits a message 
for stopping processing at a predetermined lattice point only. The predetermined lattice points judge arrival or not of 
the message for stopping processing (step ST70). They stop the processing if the message for stopping processing 
arrives (step ST71 ). While in this embodiment explanation was given on a case where a message for stopping process- 
es ing at a predetermined lattice point only is transmitted from the host computer, the same effects can also be obtained 
even if this message Is sent to all or a plural number of lattice points simultaneously or in time series. 
[0537] Moreover, while in this embodiment explanation was given on a case where processing at lattice point is 
stopped with a message, the same effects can also be obtained, when the element processors constituting the multi- 
processor according to the present invention are provided with a mechanism for receiving interrupt signals, even if the 
60 processing is stopped with an interruption made by the host computer to said element processors. 

[0538] Furthermore, while in this embodiment explanation was given on a case where a memory for storing a flag 
Indicating non converged state of lattice point is assigned to each lattice point, It is apparent that the same effects can 
also be obtained if only at least one memory exists in the system. Moreover, if the element processors constituting 
said murti-processors are of a construction having a register, the same effects can be obtained also with the use of the 
& register. 

[0538] Next, explanation will be given on the thirty-ninth embodiment of the present invention. 

[0540] In the thirty-sixth to thirty-eighth embodiments, explanation was given on a case where, a message carrying 

a flag Indicating that the processing of the own lattice point reached a state of convergence Is transferred along the 
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predetermined route, and that message is transmitted to the host computer and then the host computer starts the 
processing for detecting "oscillation phenomenon" explained In the thirty-sixth embodiment. 
[0541] However, though quite rarely, there may be cases where the state of convergence is not reached for a long 
time depending on the boundary conditions, etc. 

[0542] Explanation wfll be given hereafter on the method of judgement of convergence of the Information processing 
system workable even in a case where the system gets in a state of constant instability as mentioned above. 
[054^] In case said convergence message is not transferred to the host computer for a certain period of time after 
the processing time b measured with the timer on the host computer or the watch-dog timer installed on the system, 
etc., the processing program of the respective lattice points performs the following processings (1), (2): 

(1) Relax the standards of Judgement of convergence at respective lattice points than usual. 

(2) Attach, to the message to be transferred, a flag Indicating that the judgement was made by using relaxed 
standards of judgement. 

[0544] By performing the processings (1), (2), it becomes possible to prevent the system from getting into a state of 
constant instability. 

[0545] Next, an embodiment for realizing the processings (1), (2) will be explained based on the thirty-sixth embod- 
iment. 

[0545] Fig. 65, Fig. 66 Indicate flow charts of processing for judgement of convergence at respective lattice points 
necessary for realizing the processings (1), (2). Fig. 65, Fig. 66 correspond to Fig. 56, Fig. 57 explained in the thirty- 
sixth embodiment respectively. 

[0547] Fig. 65 Indicates a flow chart showing the processing for judgement of convergence at lattice points other 
than the lattice point which serves as starting point of the flag. 

[0546] As shown in the drawing, judgement is made first on whether or not a message carrying sn end flag arrived 
from the adjacent lattice point (step ST80). If that message arrived, the lattice point judges if a flag indicating relaxation 
of the standards of judgement is attached to the message or not (step ST81). 

[0649] If this message has no flag indicating relaxation of the standards of judgement, a convergence judging de- 
termine to performing according to the normal standards of judgement (step ST62) and the judgement of convergence 
or not Is made (step ST83). If convergence Is reached, the lattice point sends out a convergence message to the 
adjacent lattice point (step ST84). 

[0650] If, after the convergence message Is input from the adjacent lattice point, this message is found to have a 
flag Indicating that the Judgement was made by using relaxed standards of Judgement, the lattice point forms Judgement 
of convergence of the own lattice point by using the second standards set in advance (steps ST85, ST86), and sends 
out the message to the next lattice point by attaching a flag indicating relaxation of the standards of judgement to it 
(step ST87). 

[0651] Rg. 66 indicates a flow chart showing the processing for judgement of convergence at the lattice point which 
serves as starting point of the flag. Rg. 68 indicates the procedure of a case In which the lattice point forms Judgement 
of convergence of the own lattice point by using the second standards set In advance if It receives a message indicating 
change of the standards of Judgement of convergence from the host computer, and sends out the message to the next 
lattice point by attaching a flag Indicating relaxation of the standards of Judgement to It The lattice point judges arrival 
or not of the Instruction message for changing the standards of Judgement of convergence from the host computer 
(step STB0). If the instruction message arrived, the lattice point changes the standards in step ST91 . and judges rf the 
processing of the own lattice point converged with the given standards or not (step ST92). If the processing of the own 
lattice point converged with the given standards, judgement is made on if there has been a change of the standards 
of judgement of convergence or not (step ST93). The lattice point sends out the convergence message as it is to the 
adjacent lattice point (step ST95) If there was no change of the standards of judgement of convergence, but sends out 
the convergence message to the adjacent lattice point (step STBS) by attaching a flag indicating relaxation of the 
standards of Judgement to it (step ST94) rf there has been a change of the standards of judgement of convergence. 
[0552] In both the thirty-seventh and thirty-eighth embodiments, it Is apparent that the function for realizing said 
processings (1), (2) can be supplemented in the same way as in the thirty-ninth embodiment 
[0553] Next, explanation will be given on the fortieth embodiment of the present invention. This embodiment is char- 
acterized with the method of Judgement of convergence which consists in defecting unstable lattice points in the com- 
putation of convergence using a multi-processor. 

[0554] In this embodiment, the system is provided with a convergence judging program convergence judging unit 
forjudging convergence or not of the computation at respective lattice points and a register (CREG) or memory for 
storing the state of convergence. CREG can be cleared to zero from outside at optional timing and is set for 1 when it 
Is judged that the computation of the lattice point concerned has not converged yet. 

[0555] The judgement of convergence of the entire system Is made by the procedure Indicated below. Rg. 67 Iridl- 



57 



EP 0 696 001 B1 



cates a flow chart corresponding to that procedure. 

(1) Clears the CREG registers corresponding to all the lattice points to zero (step ST100). 

(2) Waits for a time sufficient for terminating one time of processing of afl the lattice points (step ST101). 
8 (3) Reads the said register of all the lattice points to make sure that It remains 0 (step ST102). 

(4) Repeats the processings (1) to (3) in case 1 is read out even at only one lattice point Terminates the processing 
if the value of said register for all the lattice points Is 0 (step ST1 03). 

[0556] While in this embodiment 1 is written in said CREQ register in the case where the convergence judging pro- 
10 gram judged that convergence Is not yet reached, this processing may be incorporated in the program from the begin- 
ning, or may be made by starting said Judging program by rewriting the program once after the convergence of com- 
putation of the lattice point concerned, for operating said CREG register. 

[0557] Moreover, while in this embodiment explanation was given on a case where a register is assigned to each 
lattice point as storing means for storing the state of convergence of the computation of lattice points, it is apparent 

1Q that the same effects can be obtained also by using memory as raid storing means. In addition, while in this embodiment 
explanation was given on a case where either a register or a memory element is assigned to each lattice point, no less 
than one piece of such register or memory will be enough in the entire system. Needless to say, assigning no less than 
one piece of said storing means to each element processor will be more effective in the case where the convergence 
judgement method according to the present invention on a system constructed with a multi-processor. 

20 [0556] As explained above, this embodiment, which is provided with at least one storing means for storing the state 
of non convergence, means for clearing all such storing means to zero and means for reading the content stored in 
such storing means for making sure that their values are all zero, is capable of judging convergence without error even 
in the case of presence of some non converged points. 

[0559] Moreover, the processing program of each lattice point writes "1" in the register (CREG) when the lattice point 
20 concerned got in the state of non convergence in this embodiment. With the addition of "1", it becomes possible to 
know the number of lattice points which got in the state of non convergence, by finally reading out the value of CREG . 
In th is way, if the processing program tor each lattice point is constructed in a way to add "1 " to the content of CREG 
In case the lattice point concerned got In the state of non convergence, It has an effect of making It possible to shorten 
the processing time by making modification of standards of Judgement of convergence, adjustment of coarseness of 
30 lattice points, etc. based on the final value of CREG. 

[0560] The Inventions of the thirty-first to thirty-ninth embodiments not only are effective for an ordinary multi-proc- 
essor having a plural number of processors as a matter of course but also have an effect of making it possible to 
operate the element processors independently even during judgement of convergence and improve the accuracy of 
the solution by combined use with an Information processing system allowing independent operation of element proo- 
fs essors such as that of the first embodiment, etc. 

[0581] Next, explanation will be given on the forty-first embodiment of the present invention. 
[0552] Fig. 71 is a block diagram showing the network construction composed of an I/O controller and data memories 
for holding data communication between mutually adjacent element processors according to the forty-first embodiment 
of the present invention. In Rg. 71, 3t is an element processor not having any data memory or I/O controller for com- 
40 munlcation with data memory inside, 300 are data memories arranged on the network, 301 is an I/O controller for 
determining access route between element processor 3t and a plural number of data memories 300 and controlling 
data communication between element processor and data memories 300. 302 are data buses connecting between 
element processor 3t and I/O controller for data communication , and 303 are connections between element processors 
connecting between element processor 3t and data memories 300. 
4* [0503] The I/O controller 301 is connected with 4 data memories 300 here, and the respective data memories 300 
are connected to other I/O controllers which are connected to the adjacent element processors of the element processor 
3t However, the number of data memories 300 connected to the I/O controller 301 is not limited to 4. The number of 
data memories 300 connected to the respective I/O controllers 301 is determined depending on the state of network, 
and the state of network is determined depending on the mode of respective analyses. 
50 [0564] The element processor 3t sends a demand for access to memory to the I/O controller 301 through the data 
bus 302. The demand for access to memory contains a data of direction of communication showing in which of the 4 
directions to have access to the data memory 300 and a description of the address number of the memory space of 
the specified data memory 300. The I/O co ntrolle 30 1 switches the direction of communication according to this demand 
for access to memory to enable access to the specified data memory 300 among those in the 4 directions. And, the 
55 data of the address No. described In the memory access demand passes through the connection 303 between element 
processors, the I/O controller 301 and the data bus 302 to be given to the element processor 3t This data given to the 
element processor 3t is a data which was written in advance by an adjacent element processor, for example, and is a 
data of equal value as that given by direct communication with the adjacent element processor. 
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[0565] Moreover, needless to say, when having access to e plural number of data arranged regularly on the memory. 
It Is possible to realize continuous access to those data by sending the forefront address and the addressing mode of 
such data from the element processor 3t to the I/O controller 301 . 

[0566] Aa explained above, by constructing a network with the I/O controller 301 placed among data memories 300, 
* It becomes possible for the adjacent element processor to have access at high speed to the data memory 300 In which 
data Is written, even in a case where the I/O controller 301 is not mounted inside the element processor 3t. 
[0567] In the network construction indicated In Fig. 71, in which the element processor 3t and the I/O controller 201 
were connected directly to each other through the data bus 302, it is necessary to build the system actions in a way 
to avoid collision of demands for reference/renewal because of the direct connection. In case of collision of demands 
10 for reference/renewal to one same memory, the processing is interrupted, leading to an increase of access time. 

[0568] The network construction Indicated in Fig. 72 Is provided with a buffer mechanism for temporarily storing 
memory access demands sent from the element processor 3t to the I/O controller 301. In Fig. 72, 304 is a buffer 
mechanism for temporarily storing memory access demands sent from the element processor 3t to the I/O controller 
301, and other parts of the same codes with those in Rg. 71 are parts corresponding to those of the same codes in 
16 Fig. 71. 

[0569] In a multi-processor constructed as shown in Fig. 72, one data memory 300 is shared by mutually adjacent 
element processors. There is a possibility that 2 mutually adjacent element processors have access to one same 
memory simultaneously. Trie buffer mechanism 304 temporarily stores requests for reference/renewal to data memory. 
If any request for reference/renewal Is stored in the buffer mechanism 304, the element processor 3t can execute 
20 processing regardless of acquisition or not of data from the data memory. Even In the case of collision of requests for 
reference/renewal to one same memory address, speedier access to memory becomes possible without any interrup- 
tion of the processing. 

[0570] Fig. 73 indicates a lattice type network topology most suitable to thermal hydraulic analysis formed by con- 
necting a plural number of element units indicated in Rg. 71. In the two-dimensional lattice type network explained in 

29 the eighth to eleventh embodiments, element processors are arranged at points of intersection of lattices. As shown 
In Rg. 73, in the case where the I/O controller 301 is placed outside the element processor 3t, the I/O controller 301 
is disposed at the position of intersection of lattices. The element processor 3t corresponds to the I/O controller 301 
at 1:1 and the I/O controller 301 and the element processor 3t are connected to each other through the data bus 302, 
end the I/O controller 301 Is also connected with 4 data memories 300. 

30 [0571] In this way, by connecting various devices such as element processor 3t data memories 300, etc. centering 
on the I/O controller 301 , It becomes possible to construct a two-dimensional lattice type network and execute memory 
access between mutually adjacent element processors at high speed. 

[0572] Naturally, it is also possible to construct a network based on a one-dimensional pipeline topology by using 
the I/O controller 301 Rg. 74 Is a block diagram showing part of the network of a multi-processor system constructed 
36 in the shape of a one-dimensional pipeline topology. In the case of a one-dimensional pipeline topology, the number 
of data memories 300 connected to the I/O controller 301 is 2. 

[0573] Needless to say, by thus constructing a network through the I/O controller 301 , it becomes possible to speed 
up the access to data memories 300 by the element processor 3t also In a one-dimenslonal pipeline type network In 
the same way as with a two-dimensional lattice type network. 
40 [0574] Moreover, a similar construction Is also possible in a case where element processors are arranged three- 
dlmenslonally. As shown In Rg. 75, in a three-dimensional construction, 6 data memories 300 are connected to one M 
0 controller. For example, a plural number of lattice type network topologies indicated in Fig. 73 may be placed one 
upon another Into a three-dimensional construction. In Fig. 75, the I/O controllers 301a and 301b belong to lattice type 
network topologies on different planes. Those two t/O controllers 301a and 301b share one data memory 300. In this 
way, by connecting various devices such as element processor 3t, data memories 300, etc centering on the I/O con- 
trollers 301a and 301b, It becomes possible to construct a three-dimensional lattice type network and execute memory 
access between mutually adjacent element processors at high speed. 

[0575] Furthermore, while this construction is applicable in optional networks, it is also possible to have data com- 
munication with a remote element processor 31, located In the neighborhood, otherthan the adjacent element processor 
so 3t in a lattice type network topology as shown in Rg. 73, for example. For example, by providing an area in which to 
write data of a prescribed element processor in the data memory and copying that area with the I/O controller 301, it 
becomes possible to obtain data of an element processor located at a certain distance from the adjacent element 
processor 31. 

[0576] The construction may also be made in such a way that the element processor outputs a request for reference/ 
63 renewal to a plural number of t/O controllers. Fig. 76 Indicates a construction realized by applying that Idea to the two- 
dimensional lattice type topology Indicated in Fig. 73. As shown in Rg. 76, the element processors 3tc, 3td can output 
a request for reference/renewal to the I/O controllers 301c arranged in 4 directions. For example, when holding com- 
munication between element processors 3tc, they output a request for reference/renewal to two mutually adjacent M 
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0 controllers 301c sharing one data memory 300. This makes it possible to hold communication between element 
processors 3tc by skipping one adjacent element processor 3td. For example, this makes it possible to make selective 
use of both coarse node coupling performing computation by using element processors 3tc only and fine node coupling 
performing computation by using element processors 3tc, 3td. 

[0677] Still more, the element processors 3t may not only be connected to the I/O controllers 301 arranged around 
them through data bus to output requests for reference/renewal but also be connected to the I/O controllers 301 at 
more distant positions through date bus depending on the object of analysis, accuracy, etc., as a matter of course. 
[0578] Next, explanation will be given on the forty-second embodiment of the present invention. 
[0579] In the thirtieth to thirty-fourth embodiments, the convergence judgement flags (1 orO) output by the respective 
element processors 3k, 3m, 3p, 3q were logically multiplied one by one by means of the convergence judging unit 180 
or the logical circuit mounted Inside the convergence judging unit 1 63, and eventually transferred to the host computer 

1 as convergence judgement information. By the way, in the case of a small-scale multi-processor system in which the 
number of element processors connected is tens of unit or so, it Is possible to take the logical product of the convergence 
judgement bits at sufficiently high speed even with the construction indicated In the thirtieth to thirty-fourth embodiments. 
However, in the case of a large-scale multi-processor system in which the number of element processors connected 
amounts to as many as hundreds of unit, the total time required for the completion of logical products in all the con- 
vergence judging units 180 or the convergence judging unit 183 and the transfer to host computer 1 of the results of 
judgement of convergence becomes enormous, and this reduces the effect of having a special hardware constructed 
by beating the convergence judging units 160, for example, outside the element processor 3k, etc 

[05801 Fig. 77 is a conceptual drawing indicating an outline of the convergence Judging system according to the forty- 
second embodiment of the present invention. In Fig. 77, 3 are element processors, 305 are registers for convergence 
Judgement bits for temporarily storing convergence judgement bits, 306 are convergence judgement bit wires for trans- 
mitting convergence judgement bits output from the element processors 3 by connecting between the element proc- 
essors and the registers for convergence judgement bits 305, 308 is a convergence information signal wire for trans- 
mitting convergence Information to the host computer, 309 are N type MOS transistors having a source connected to 
the grounding potential point 307 fixed at a grounding potential ofVss, a gate connected to the registers for convergence 
judgement bits 305 and a drain connected to the convergence information signal wfre 308, and 310 Is a resistor having 
an end connected to the supply potential point 31 1 and the other end connected to the convergence Information signal 
wire 308. Trie wired OR is composed of N type MOS transistors 309, convergence information signal wire 308 and 
resistor 310. 

[0581] At least 1 bit of convergence Judgement bits output respectively from the respective element processors 3 is 
temporarily preserved on the register 305 for convergence judgement bits through the convergence judgement bits 
wire 306. The convergence Judgement bits stored in the register 305 for convergence judgement bits are submitted to 
logical multiplication by wired OR and are reflected In at least 1 bit of the results of judgement of convergence output 
on the convergence information signal wire 308. 

[0582] Now, definition will be given as a state of convergence for a state in which the convergence judgement bits 
output from the respective element processors 3 are "0 n , and as a state of non convergence If they are "1\ In the case 
where all of the respective element processors 3 output "0", namely a state of convergence, as convergence Judgement 
bits, the N type MOS transistors 309 all get in a state of non continuity, the convergence information signal wire 308 
outputs a supply potential of VDD through the resistor 310 and the host computer can recognize that the entire system 
has reached a state of convergence. 

[0583] Moreover, in the case where at least one of the element processors 3 outputs "1° as convergence judgement 
bit (namely a state of non convergence), it is possible to make the convergence information signal wire 308 output a 
potential which can be considered digitally as supply potential of VDD, by setting the resistance value of the resistor 
310 sufficiently larger compared with the resistance value at the time of continuity of the N type MOS transistors 309, 
because the corresponding N type MOS transistors 309 remain in the state of continuity. With the convergence Infor- 
mation signal wire 308 kept at a potential which can be considered digitally as VSS, the host computer 1 can recognize 
that the system as a whole has not yet reached a state of convergence. 

[0584] Needless to say, In this embodiment definition is given as a state of convergence when the logical values of 
the convergence judgement bits output from the respective element processors 3 are "1", and as a state of non con- 
vergence if they are "0*, and a circuit making similar actions can be constructed by changing the N type MOS transistors 
Into P type MOS transistors. 

[0585] By constructing the convergence judging unit as described above, It becomes possible to continuously obtain 
the logical value of the convergence judgement bits preserved on ail the register 305 for convergence Judgement bits 
In a short time of several micro seconds or so and thus shorten the convergence Judging time while forming Judgement 
of convergence independently of the processing for operations. 

[0586] Next, explanation will be given on the forty-third embodiment of the present invention. 

[0587] In the method of paralleling of thermal hydrauDc value analysis by the method of division of area, the compu- 
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tation arse is divided into areas in a number at least equal to that of element processors 3 end the computation of the 
respective area is assigned to the respective element processors 3. In the case where the communication system 
between element processors is set for low-speed remote access of message passing type as rt is generally practiced, 
It Is possible to realize parallel operations to some extent by averaging the calculation load by maintaining the number 

* of lattice points assigned to the element processors 3 at about the same level but, when some unstable lattice points 
exist In the divided areas, a lot of repetitive operations are required before a solution of convergence is obtained 
because of the differences in the volume of processing of the element processors 3, and this produces an increase of 
the volume of processing of the element processors 3. For that reason, the multi-processor system is constructed by 
asynchronous communication system completely eliminating any latency due to waiting for synchronization of com- 

10 munication, and the number of lattice points to be mounted is changed for each element processor 3, to solve the 
above-mentioned problem. 

[0 583] Fig . 78 indicates the form of division of computation area including unstable lattice points for analysis. In Fig . 
78, 31 3 Is the entire computation area used for the analysis, 3 14 are divided areas to be taken charge by the respective 
element processors 3, 315 are ordinary lattice points, and 316 are unstable lattice points for analysis where some 

16 unstable factor such as chronological change of physical quantity, etc. exists. Factors producing instability of analysis 
include, for example, the points on which the speed of an object changes rapidly, the points on which the temperature 
changes rapidly, the points in vortex, the points on which a physical value changes in stationary, the points neighboring 
to the moving object or moving boundary, the points in the turning fluid, the singular points on the comer, the points of 
heat generation, the points of inflow & effluence of fluid, etc., points of movement of substance, etc. and points in their 

20 neighborhood. 

[0589] Next, explanation will be given on actions. 

[0590] In the element processors 3 taking charge of the divided areas 314 including unstable lattice points for analysis 
316, the number of times of repetition up to obtainment of a spatial solution of convergence increases because the 
instability of the unstable lattice points for analysis 316 has Influence on all lattice points in the divided areas 314. For 

25 that reason, the number of lattice points In the divided areas Including unstable lattice points for analysis 316 is made 
smaller than the number of lattice points in other divided areas not including any unstable lattice points for analysis 
316. This makes it possible to make up for the Increase of processing volume due to increase of number of times of 
repetition by reducing the number of lattice points. Moreover, by adopting asynchronous communication system, it 
becomes possible to execute the computation of divided areas 314 including unstable lattice points for analysis 316 

30 prior to the execution of stable divided arm 31 4 not includin g any unstable lattice points for analysis 31 6. As explained 
above, by combined use of asynchronous communication system and uneven distribution of lattice points, it becomes 
possible to increase the speed of operations for convergence even in a case where unstable areas are included in the 
computation area. 

[0591] Next, explanation will be given on the forty-fourth embodiment of the present Invention. 
[0592] The computation circuit used in the element processors 3 explained in the first to fifth embodiments is mostly 
of synchronous system based on a clock provided In the element processors 3. This is because, at the present level 
of technology, the relation of reaction time of elements In the circuit (element delay) > delay due to transmission of 
Information In the chips (wiring delay) Is established. 

[0593] However, the rapid progress of super high-speed element technology in recent years has enabled actions 
with an element delay of 10 ps with silicon semiconductor element, 6 ps with compound semiconductor element and 
1 ps level with Josephson element Moreover, in line with the progress of VLSI technology, the chip dimensions ex- 
panded to 15 mm x 15 mm, and the number of mountable elements is going to increase up to 3 million pieces with 
logical LSI. if such plco-second LSI Is realized, while the time up to outputting of the results of processing for an input 
In the element (element delay) is reduced, the time required for the information to pass through the wiring in the chips 

45 (wiring delay) which has so far been not questioned becomes non negligible. If a chip is constituted with an element 
having an element delay of 1 plco-second, the action in a single clock Is guaranteed only up to a chip surface area of 
0.3 mm x 0.3 mm max. for the entire chip. So, by making a circuit design based on an In-processor asynchronous 
system which does not synchronize actions in the chip with e clock, it becomes possible to sharply improve the process- 
ing capacity of the element processors. Moreover, asynchronous system design provides such advantages as (J) to 

60 <S> * etc. to be described later and makes it possible to improve robustness and high-speed operation of the element 
processors 3 and eventually achieve high-speed thermal hydraulic analysis. 

(D Enables normal actions regardless of the amount of element delay according to the causal relations of transition 
of signals, achieving high-speed operation. 
03 @ Achieves high reliability by eliminating timing fluctuations even against external disturbances such as voltage 
fluctuations, temperature & environmental changes, eta of the information processing system. 
<§) Facilitates the design work because no timing adjustment is required. 

® Does not require any electric power for generating clock and therefore provides an information processing 
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system of low power consumption. 

[0594] Asynchronous circuit having such characteristics can be used for the circuit in the element processors 3. So, 
explanation will be given hereafter on the basic construction of an asynchronous circuit 

o [0595] An asynchronous system Is an aggregate of functional modules constructed with asynchronous circuits mu- 
tually exchanging signals. If any request for processing is made from a certain functional module A to another functional 
module B, the actions to one request for processing between the functional modules A and B ere completed with an 
exchange of 2 signals or a demand signal named "demand" indicating the request for processing between module A 
to module B and a response signal named "response" indicating notification of end of processing from module B to 

10 module A. However, in an asynchronous system, there is no need of any device for notifying the timing of arrival of 
those demand signal and response signal because no synchronizing clock exists. First, for control signals, if definition 
is given as "1" for "valid" and °0" for "invalid", the transition itself of either 0 1 or 1 -> 0 expresses the timing of 
demand or response. What is to be questioned is a case where the exchanged signals are data signals. Usually, data 
signals are of single-wire system and can only be given In binary expressions, and there is no way of describing the 

io timing of demand and response. As a solution to this problem, the exchange of data signals between a plural number 
of functional modules constructed with asynchronous circuits is made by using two-wire type data expressing 1-bit 
information with 2 signal wires. As two-wire type data expressions, there are 2 -wire 2-phase system, 2-wire transition 
system, parity alternating system, etc. Among those systems, the 2-wire 2-phase system is highly flexible and enables 
simplification of circuit construction. So explanation will be given hereafter on a case where the data signal expression 

20 of 2-wire 2-phase system is used. 

[0596] In the data expression of 2-wire 2-phase system, a-bit data D is expressed as in Formula 40 with two-wire 
signals (d1,d0). 

^ D = 0 ^(d1,d0) = (0,1) 

D = 1 <->(d1,d0) = (1,0) 
D = state of spacer <->(d1,d0) = (0,0) (40) 

30 

[0597] For example, to transfer "1", the system performs a series of actions of changing the pause phase (0, 0) to 
(1 , 0) and, after the end of processing, resetting it to a state of spacer (0, 0) to get ready for the next data transfer. 
[0596] Next, data transfer route based on this data expression of 2-wire 2-phase system wiD be examined hereafter. 
[0599] Fig. 79 Is a block diagram showing a model of data transfer route between registers based on the data ex- 

35 press! on of 2-wire 2-phase system. In Fig. 79, 317 are control signal buses for transferring control signals, 318 is a 
2-wire 2-phase control module for resetting 2-wire 2-phase signals after the end of processing for demand & response, 
319 are registers for temporarily storing signals, 320 is a two-wire logical circuit, and 321 are two-wire type data buses. 
It is to be noted, however, that the two-wire logical circuit 320 In the drawing is composed of a basic logical element 
constructed based on two-wire logic. 

40 [0600] At the receipt of information of end of processing from the upper unit, the 2-wlre 2-phase control module 318 
generates a demand (transition 0 1). At this demand, two-wire data signal is read out from the transferring register 
319. The data read out from the transferring register 319 Is given to the two-wire logical circuit 320 through the two- 
wire type data bus 321 and Is treated by the two-wire logical circuit 320. Moreover, the data treated by the two-wire 
logical circuit 320 is written in the destination register 31 9 through the two-wire type data bus 32 1 . After that, the 2-wire 

45 2-phase control module 31 8 which received the response (transition 0-> 1) generated by the register 319 transfers 
information of end of processing to the tower unit Following the information of end of processing from the upper unit, 
a command for extinction of demand (1 -» 0) is transferred from the upper unit, passes through the transferring register 
31 9 to the logical circuit 320 up to the destination register 31 9 to put all the two-wire signals there in the state of spacer, 
and to eventually extinguish the response (transition 1 -> 0) to terminate the operation. 

so [0601] The two-wire logical circuit 320 in Fig. 79 may be a circuit of whatever construction if only It is a circuit con- 
structed by connecting basic logical elements such as two-wire type AND gate, OR gate, NOT gate, etc For example, 
by realizing the entire processing unit 6 indicated in either of the first embodiment to the fifth embodiment with two- 
wire logical circuits 320, it becomes possible to realize high-speed operation in the element processors 3 even in case 
the wiring delay gets larger than the element delay. 

55 [0602] Needless to say, asynchronous processing not using any clock can be applied to specific operation columns 
only and the same effects can be obtained also in this case. 
[0603] It Is also possible to use said different embodiments In combination. 

[0604] While the Invention has been shown and described In detail, the foregoing description is In all aspects illus- 



62 



EP0 696 001 B1 



tret ive and not restrictive. It is therefore understood that numerous modifications and variations can be devised without 
departing the scope of the invention as defined by the claims. 



b Claims 

1. An information processing system (2), comprising: 

a plural number of element processors (3) respectively having processing units (5) for performing computing 
10 operations and associated data memories (8; 76; 300) for storing data regarding said computing operations 

for performing computation by assigning, regarding at least one governing equation of a field which can be 
described in at least one prescribed equation by using said plural number of element processors (3), a plural 
number of lattice points obtained by spatially dividing said field to said element processors (3); and 
a plural number of Individual data buses (7) for having mutual communications of said data between adjacent 
™ ones of said element processors (3), wherein 

said computation is performed by operating said element processors (3) asynchronously and independently 
by using said data transmitted through said data buses (7); 

said information processing system (2) being characterized In that each of said element processors (3) di- 
rectly writes to or reads from said associated data memories (8; 76; 300) of adjacent element processors (3) 
20 through one of said Individual data buses (7) for communication therebetween. 

2. The Information processing system (2) of Claim 1, wherein said associated data memory (8; 76; 300) of each 
respective element processor (3) comprises data memory (8) within said respective element processor (3). 

25 3. The information processing system (2) of Claim 1, wherein said associated data memory (8; 76; 300) of each 
respective element processor (3) comprises shared memory (76) arranged outside said respective element proc- 
essor (3) along a data path between said respective element processor (3) and an element processor (3) adjacent 
thereto. 

30 4. The information processing system (2) of Claim 1, wherein said associated data memory (8; 76; 300) of each 
respective element processor (3) holds duplicate data of selected adjacent element processors (3). 

5. The information processing system (2) of Claim 1, wherein a plural number of time steps corresponding to identical 
ones of said lattice points are furthermore assigned to said element processors (3). 

6. The information processing system (2) of Claim 6, wherein a plural number of two-dimensional planes (18), in 
which said element processors (3) performing the processing of said lattice points obtained by spatially dividing a 
two-dimensional field are two-dimensional ly arranged, are arranged along the time base to be constructed three- 
dimensionally, when a two-dimensional analytical solution is looked for, and said element processors (3) ere oon- 

*o struct ed In pipeline In the direction of time base. 

7. The Information processing system (2) of Claim 1 , further comprising transmission means (4) for transmitting initial 
conditions and boundary conditions to said element processors (3). 

^5 8. The Information processing system (2) of Claim 1 , wherein data communications between said adjacent element 
processors (3) are performed individually for each of said element processors (3), asynchronously and repeatedly. 

9. The information processing system (2) of Claim 1 , wherein data communications between said adjacent element 
processors (3) are message communications and performed with multiple instruction stream and multiple data 

50 stream. 

10. The Information processing system (2) of Claim 1 , wherein said element processors (3) can directly refer, respec- 
tively, to said associated data memory (8; 76; 300) of said adjacent element processors (3). 

55 11. The Information processing system (2) of Claim 1 0, wherein said element processors (3) can directly refer to and 
renew, respectively, said associated data memory (8; 76; 300) of said adjacent element processors (3). 

12. The Information processing system (2) of Claim 1 , wherein said associated data memories (6; 76; 300) Include, 
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on the routes (7; 302, 303) connecting said adjacent element processors (3), first memories (76; 300) shared by 
said adjacent element processors (3) and can directly refer to and renew the content stored In said first memories 
(76; 300) from a plural number of said element processors (3) connected to said first memories (76; 300). 

& 13. The Information processing system (2) of Claim 10, wherein said associated data memories (6) Include multlport 
memories (8; 76a). 

14. The Information processing system (2) of Claim 1, further comprising buffer means (60; 304) for temporarily buff- 
ering access demand to said associated data memories (8; 76; 300). 

10 

15. The Information processing system (2) of Claim 1, wherein 

said element processors (3) respectively have buffer memories (76a) to which data sent from said adjacent 
element processors (3) are written as data with object name In communications between said adjacent element 
io processors (3), and 

said element processors (3) complete reception of data from said adjacent element processors (3) by searching 
data with object name at optional timing from said buffer memories (76a). 

16. The information processing system (2) of Claim 1, wherein 

20 

said element processors (3) respectively have matching memories (83) for waiting for said data necessary for 
at least one of said computing operations, and 

said processing unit (6) In each of said element processors (3) performs at least one prescribed operation 
among said computing operations at the point in time sequence when the necessary data are available in said 
20 matching memory (83) in the same. 

1 7. The information processing system (2) of CI aim 1 , wherein said element processors (3) respectively rave program 
memories (11) available for rewriting the program stipulating the operations of said processing units (5). 

18. The information processing system (2) of Claim 1, wherein 

each of said processing units (6) works based on a data flow system which enables simultaneous execution 
of a plural number of said program assigned to different areas of each of said program memories (11), and 
each of said processing units (5) executes said computing operations of a plural number of said lattice points 
30 simultaneously by assigning a plural number of programs for processing said respective lattice points to dif- 

ferent areas of each of said program memories (11). 

19. The information processing system (2) of Claim 18, wherein said data flow system Includes a dynamic data driven 
system capable of simultaneously executing one same program on input data of a plural number of sets with the 

40 use of identifiers. 

20. The information processing system (2) of Claim 1, wherein said processing units (5) respectively have hardware 
circuits (13) In which arithmetic-logic elements (14) are wired In the shape based on a data flow graph according 
to the wi red-logic for part or whole of the data processing flow. 

40 

21. The information processing system (2) of Claim 1, wherein 

said processing units (5) respectively have hardware circuits (13) according to one same wired-) ogle, 

said hardware circuits (1 3) load In advance said associated data memories (8) of said element processors (3) 

60 with internal point bits indicating that said lattice points are positioned inside a computing area in said field or 

outside said object in said computing area with respect to all of said lattice points, with said external point bits 
Indicating that said lattice points are positioned outside said computing area or Inside an object In said com- 
puting area, and with boundary condition bits indicating information of boundary conditions with respect to 
said lattice points positioned on the boundary of said computing area, and 

00 said hardware circuits (13) change the processing of the data flow graph according to said internal point bits, 

said externa) point bits and said boundary condition bits. 

22. The Information processing system (2) of Claim 20, wherein said processing units (5) further respectively have 
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arithmetic devices (10) for directly executing operation codes for cumulatively adding the respective results of 
multiplication of first Input by second input, third input by fourth input, fifth Input by sixth input, .... (2 x N - 1)-th 
input by (2 x N)-th input to 2N pieces of input data. 

0 23. The information processing system (2) of Claim 1, further comprising fine node couplings (7a) connecting said 
adjacent element processors (3) to each other and coarse node couplings (7b) connecting said element processors 
(3) to each other by skipping no less than one said element processors (3) between them. 

24. The information processing system (2) of Claim 1, wherein said element processors (3) respectively have output 
10 units (91 ) for asynchronously reading output data, furnishing that output data with identifiers necessary for arrang- 
ing and sorting of data and sending data to outside said element processors (3). 

25. The Information processing system (2) of Claim 24, further comprising output data image processing unit (97) for 
collecting output data from said output units (91) of said element processors (3) and generating visible images of 

18 said output data In real time based on the output data arriving asynchronously from all of said element processors 

(3). 

26. The information processing system (2) of Claim 1 , further comprising coordinate conversion unit (120) for convert- 
ing a first coordinate system used by a host computer (1) controlling said element processors (3) to a second 

20 coordinate system used by said element processors (3) when date are transferred from said host computer (1 ) to 
said element processors (3), and converting said second coordinate system used by said element processors (3) 
to said first coordinate system used by said host computer (1) when data are transferred from said element proc- 
essors (3) to said host computer (1). 

25 27. The information processing system (2) of Claim 1, wherein 

said plural number of element processors (3) integraDy controlled by a host com puter (1 ) respectively comprise 
first convergence judging units (183), provided in each of the element processors (3) and independently of 
said processing units (5), for Judging convergence without disturbing the computation of the processing units 
30 (5), and 

said information processing system (2) further comprising: 

first communication routes (181) for dividing a plural number of said element processors (3) into a plural 
number of groups and connecting said first convergence judging units (183) of said respective element 
3$ processors (3) in the form of a unidirectional closed loop or In a centralized form in said respective groups; 

second convergence Judging units (160), connected to said first communication routes (181) respectively, 
for integrating the results of Judgment of convergence of all said first convergence Judging units (183) In 
said groups; and 

second communication routes (182) for connecting all of the second convergence Judging units (180) of 
4> said groups In the form of a unidirectional closed loop or In a centralized form. 

28. The information processing system (2) of Claim 1, further comprising: 

at least one route (181) for transferring messages of a plurality of said element processors (3); 

first convergence Judging means (183) for judging convergences on said element processors (3) or no less 

than one processing means (5) assigned to each of said element processors (3); 

first message sending means (183) for sending a first message indicating the convergence on a prescribed 
element processor (3) or the processing means (5) of said prescribed element processor (3) when said first 
convergence judging means (183) judges the convergence on said prescribed element processor (3) or said 

so processing means (5) of the same; 

second message sending means for sending a second message indicating the convergence on the element 
processor (3) or the processing means (5) receiving said first or second message from the adjacent element 
processor (3) or the adjacent element processor (3) after said element processor (3) orsald processing means 
(5) receiving said first or second message; and 

35 second convergence Judging means for judging the convergence of all said element processors (3) by trans- 

ferring said second message one after another through said at least one route by raid second sending means. 

29. The Information processing system (2) of Claim 28, wherein 
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said at least one route includes a plurality of different routes connecting said processing means (5), and 
said second convergence Judging means confirms said convergence of all said element processors (3) in 
multiplex ways by the messages transferred one after another through said plurality of different routes. 

0 30. A method of computation performed by using an information processing system (2), wherein 

said information processing system (2) comprises: 

a plural number of element processors (3) respectively having processing units (5) for performing com- 
"> puting operations and associated data memories (8; 76; 300) for storing data regarding said computing 

operations for performing computation by assigning, regarding at least one governing equation of a field 
which can be described in at least one prescribed equation by using said plural number of element proc- 
essors (3), a plural number of lattice points obtained by spatlaOy dividing said field to said element proc- 
essors (3); and 

w a plural number of individual data buses (7) for having mutual communications of said data between 

adjacent ones of said element processors (3), wherein 

said computation is performed by operating said element processors (3) asynchronously and Independ- 
ently by using said data transmitted through said data buses (7); 

said method of computation being characterized In that each of said element processors (3) directly 
20 writes to or reads from said associated data memories (8; 76; 300) of adjacent element processors (3) 

through one of said Individual data buses (7) for communication therebetween. 

31. The method of computation performed by using an Information processing system (2) of Claim 30, wherein said 
associated data memory (8; 76; 300) of each respective element processor (3) comprises data memory (8) within 

20 said respective element processor (3). 

32. The method of computation performed by using an information processing system (2) of Claim 1, wherein said 
associated data memory (6; 76; 300) of each respective element processor (3) comprises shared memory (76) 
arranged outside said respective element processor (3) along a data path between said respective element proo- 

30 essor (3) and an element processor (3) adjacent thereto. 

33. The method of compulation performed by using an information processing system (2) of Claim 30, wherein dupli- 
cate data of selected adjacent element processors (3) is written to the associated data memory (8; 76; 300) of 
each respective element processor (3). 

35 

34. The method of computation performed by using an information processing system (2) of Claim 30, wherein a plural 
number of time steps corresponding to identical ones of said lattice points are furthermore assigned to said element 
processors (3). 

38. The method of computation performed by using an Information processing system (2) of Claim 34, wherein a plural 
number of two-dimensional planes (18), in which said element processors (3) performing the processing of said 
lattice points obtained by spatially dividing a two-dimensional field are two-dimensionelly arranged, are arranged 
along the time base to be constructed three-dimenslonally, when a two-dimensional analytical solution is Looked 
for, and said element processors (3) are constructed In pipeline in the direction of time base. 

45 

38. The method of computation performed by using an Information processing system (2) of Claim 30, wherein 

said at least one prescribed equation includes at least one differential equation, and 
said at least one prescribed equation obtains the solution to said at least one differential equation by operating 
80 said element processors (3) asynchronously and independently by finite difference method. 

37. The method of computation performed by using en Information processing system (2) of Claim 30, wherein said 
at least one prescribed differential equation includes at least one partial differential equation or at least one ordinary 
differential equation. 

65 

38. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said element processors (3) take charge of computation of a plural number of lattice points obtained by spatially 
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dividing said field respectively, and 

the number corresponding to said lattice points or the base address of the data area corresponding to said 
lattice points on said associated data memories (8; 76; 300) are used as input tokens to control said compu- 
tation performed by said element processors (3). 

39. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
information processing system (2) performs said computation by using transmission means (4) for transmitting 
initial conditions and boundary conditions to said element processors (3). 

40. The method of computation performed by using an information processing system (2) of Claim 30, wherein data 
communications between said adjacent element processors (3) are performed individually for each of said element 
processors (3), asynchronously end repeatedly. 

41. The method of computation performed by using an Information processing system (2) of Claim 30, wherein data 
communications between said adjacent element processors (3) are message communications and performed with 
multiple instruction stream and multiple data stream. 

42. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
element processors (3) directly refer, respectively, to said associated data memories (6; 76; 300) of said adjacent 
element processors (3). 

43. The method of computation performed by using an Information processing system (2) of Claim 36, wherein said 
element processors (3) directly refer to and renew, respectively, said associated data memories (8; 76; 300) of 
said adjacent element processors (3). 

44. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
associated data memories (6; 76; 300) include, on the routes (7; 302, 303) connecting said adjacent element 
processors (3), first memories (76; 300) shared by said adjacent element processors (3) and directly refer to and 
renew the content stored In said first memories (76; 300) from a plural number of said element processors (3) 
connected to said first memories (76; 300). 

46. The method of computation performed by using an information processing system (2) of Claim 42, wherein said 
associated data memories (8) include multipart memories (8; 76a). 

46. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
information processing system (2) performs said computation by using buffer means (60; 304) for temporarily 
buffering access demand to said associated data memories (6; 76; 300). 

47. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said element processors (3) respectively comprise buffermemories (76a) to which data sent from said adjacent 
element processors (3) ere written as data with object name in communications between said adjacent element 
processors (3), and 

said element processors (3) complete reception of data from said adjacent element processors (3) by searching 
data with object name at optional timing from said buffer memories (76a). 

48. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said element processors (3) respectively comprise matching memories (83) for waiting for said data necessary 
for at least one of said computing operations, and 

said processing unit (6) in each of said element processors (3) performs at least one prescribed operation 
among said computing operations at the point in time when the necessary data are available In said matching 
memory (63) in the same. 

49. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
element processors (3) respectively comprise program memories (11) available for rewriting the program stipulating 
said computing operations of said processing units (5). 
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50. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said processing units (5) perform said computing operations by using one same computation program, 
said computation program loads in advance said associated data memories (8) of said element processors 

0 (3) with internal point bits Indicating that said lattice points are positioned inside a computing area in said field 

or outside said object in said computing area with respect to all of said lattice points, with said external point 
bits Indicating that said lattice points are positioned outside said computing area or inside an object in said 
computing area, and with boundary condition bits indicating information of boundary conditions with respect 
to said lattice points positioned on the boundary of said computing area, and 

"> said computation program changes the processing on said element processors (3) according to said internal 

point bits, said external point bits and said boundary condition bits. 

51. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

*5 each of said processing units (5) works based on a data fiow system which enables simultaneous execution 

of a plural number of said program assigned to different areas of each of said program memories (11), end 
each of said processing units (5) executes said computing operations of a plural number of said lattice points 
simultaneously by assigning a plural number of programs for processing said respective lattice points to dif- 
ferent areas of said program memories. 

2D 

52. The method of computation performed by using an information processing system (2) of Claim 51, wherein said 
data fiow system includes a dynamic data driven system capable of simultaneously executing one same program 
on input data of a plural number of sets with the use of identifiers. 

25 53. The method of computation performed by using an information processing system (2) of Claim 52, wherein said 
processing units (5) calculate the address of said associated data memories (8) which store data regarding said 
adjacent element processors (3) and make reference to and renewal of said associated data memories (8) by 
using the address of said associated data memories (8), based on the value of said identifiers assigned to said 
input data. 

30 

54. The method of computation performed by using an Information processing system (2) of Claim 30, wherein said 
processing units (5) respectively have hardware circuits (13) in which arithmetic-logic elements (14) are wired in 
the shape based on a data flow graph according to the wi red-logic for part or whole of the data processing flow. 

35 55. The method of computation performed by using an Information processing system (2) of Claim 30, wherein 

said processing units (5) respectively have hardware circuits (13) according to one same wired-! ogle, 
said hardware circuits load In advance said associated data memories (8) of said element processors (3) with 
Internal point bits Indicating that lattice points are positioned inside a computing area in said field or outside 
40 said object In said computing area with respect to all of said lattice points, with said external point bits Indicating 

that said lattice points are positioned outside said computing area or inside an object in said computing area, 
and wfth boundary condition bits indicating Information of boundary conditions with respect to said lattice points 
positioned on the boundary of raid computing area, and 

said hardware circuits (13) change the processing of the data flow graph according to said internal point bits, 
45 said external point bits and said boundary condition bits. 

58. The method of computation performed by using an information processing system (2) of Claim 54, wherein said 
processing units (5) respectively have arithmetic devices (10) for directly executing operation codes for cumula- 
tively adding the respective results of multiplication of first Input by second input, third Input by fourth Input, fifth 
go input by sixth input (2x N - 1)-th input by (2 x N)4h input to 2N pieces of input data. 

57. The method of computation performed by using an Information processing system (2) of Claim 54, wherein said 
information processing system (2) levels the number of times of communications at all of said lattice points by 
performing dummy communications even for said lattice points with small number of times of communications for 

55 solving said at least one governing equation of said field because said lattice points forming the subject of com- 

putation are found on the border of areas. 

58. The method of computation performed by using an Information processing system (2) of Claim 30, wherein said 
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information processing system (2) levels the number of times of operations at all of said lattice points by performing 
dummy operations even for said lattice points with small number of times of operations for solving said at least 
one governing equation of said field because said lattice points forming the subject of computation ere found on 
the border of areas. 

e 

59. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
information processing system (2) which performs processing of each of said lattice points for finding solution to 
at least one differential equation by finite difference method and performs communications between the respective 
lattice points, information indicating the necessity or not of communication and the method of communication for 

10 each of said lattice points. 

60. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
Information processing system (2) which performs processing of each of said lattice points for finding solution to 
at least one differential equation by finite difference method assigns a memory area to each of said lattice points, 

15 performs communications between information of adjacent lattice points by raid memory area and provides each 

of said lattice points with information for indicating said memory area to be used for communication in each direction 
where communication should be made. 

61. The method of computation performed by using an Information processing system (2) of Claim 60, wherein said 
20 Information processing system (2) provides each of said lattice points with information indicating a common dummy 

area in the direction where communication is impossible. 

62. The method of computation performed by using an Information processing system (2) of Claim 30, wherein when 
said adjacent lattice points are assigned over different said element processors (3), said element processors (3) 

26 on both sides hold two pairs of data in time for intercommunication and, when the data for those lattice points has 

been renewed, said processors (3) do not rewrite the data if any displacement of the renewed data is no larger 
than the prescribed value in comparison with the data of the chronologically previous pair. 

63. The method of computation performed by using an Information processing system (2) of Claim 60, wherein said 
30 information processing system (2) assigns, to said associated data memory (8), virtual data area simulating lattice 

points which are closest to the border of the computation area forming the subject of the calculation of said at least 
one prescribed equation and found outside said computation area. 

64. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
36 information processing system (2) comprises fine node coupling (7a) connecting mutually adjacent said element 

processors (3) to each other and coarse node coupling (7b) connecting said element processors (3) to each other 
by skipping no less than one said element processors (3) between them. 

65. The method of computation performed by using an information processing system (2) of Claim 64, wherein said 
40 Information processing system (2) advances computation of said at least one prescribed equation by alternately 

performing computation by said element processors (3) connected by said coarse node coupling (7b) and com- 
putation by said element processors (3) connected by said fine node coupling (7a). 

66. The method of computation performed by using an Information processing system (2) of Claim 30, wherein said 
45 information processing system (2) performs calculation of said at least one governing equation of said field using 

only part of said lattice points selected, in parallel with the calculation of said at least one governing equation of 
said field using all of said lattice points. 

67. The method of computation performed by using an Information processing system (2) of Claim 30, wherein the 
so algorithm of the calculation of said at least one governing equation of said field using only selected ones of said 

lattice points is different from the algorithm used for the calculation of said at least one prescribed governing 
equation of said field using all of said lattice points. 

6a The method of computation performed by using an Information processing system (2) of Claim 67, wherein said 
*ff information processing system (2) has no less than two pairs of data regarding mutually adjacent said lattice points 
in time sequence and stops the operation of said lattice points if the amount of fluctuation between said pairs of 
data is no larger than the prescribed amount. 
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69. The method of computation performed by using an information processing system (2) of Claim 68, wherein the 
value of said prescribed amount for omitting the operation of said lattice points depending on the value of fluctuation 
between said pairs of data is reduced according to the grade of convergence. 

70. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
element processors (3) respectively have output units (91) for asynchronously reading output data, furnishing that 
output data with identifiers necessary for arranging and sorting of data and sending data to outside said element 
processors (3). 

71. The method of computation performed by using an information processing system (2) of Claim 70, wherein said 
information processing system (2) comprises output data image processing unit (97) for collecting output data from 
said output units (91) of said element processors (3) and generating visible images of said output data in real time 
based on the output data arriving asynchronously from all said element processors (3). 

72. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
information processing system (2) comprises coordinate conversion unit (120) for converting an first coordinate 
system used by a host computer (1) controlling said element processors (3) to a second coordinate system used 
by said element processors (3) when data are transferred from said host computer (1) to said element processors 
(3), and converting said second coordinate system used by said element processors (3) to said first coordinate 
system used by said host computer (1) when data are transferred from said element processors (3) to said host 
computer (1). 

73. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said plural number of element processors (3) area plural number of element processors (3) integrally controlled 
by a host computer (1), 

said plural number of element processors (3) comprise convergence, Judging units (1 83), provided in each of 
the element processors (3) and Independently of said processing units (5) respectively, for Judging conver- 
gence without disturbing the computation of the processing unit (5), and 
said Information processing system (2) comprises 

first communication routes (181) for dividing a plural number of said element processors (3) into a plural 
number of groups end connecting said convergence judging units (183) of said respective element proc- 
essors (3) in the form of a unidirectional closed loop or in a centralized form in said respective groups, and 
second communication routes (182) connected to each of said first communication routes (181) of said 
groups, and connecting ail of the convergence judging units (183) integrating the results of judgement of 
convergence of all said convergence Judging units (183) in said groups and all of said convergence judging 
units (183) of said groups In the form of a unidirectional closed loop or In a centralized form. 

74. The method of computation performed by using an information processing system (2) of Claim 30, wherein 

said Information processing system (2) comprises at least one route for transferring messages of a plurality 

of said element processors (3), end 

said method of computation further comprising steps of 

(a) Judging convergences on each of said element processors (3) or no less than one processing means 
(5) assigned to each of said element processors (3); 

(b) sending a first message Indicating a prescribed element processor (3) or a processing means (5) of 
said prescribed element (3), when said Information processing system (2) Judges the convergence on said 
prescribed element processor (3) or said processing means (5) of said prescribed element (3); 

(c) sending a second message indicating the convergence on the element processor (3) or the processing 
means (5) receiving said first or second message from the adjacent element processor (3) or the adjacent 
element processor (3) after said element processor (3) or said processing means (5) receiving said first 
or second message; and 

(d) Judging the convergence of all said element processors (3) by transferring said second message one 
after another by repeating said step (c). 

75. The method of computation performed by using an information processing system (2) of Claim 74, wherein said 
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Information processing system (2) repeats said step (a) to step (d) for a plurality of confirmations of said conver- 
gence of all said element processors (3). 

76. The method of computation performed by using an Information processing system (2) of Claim 75, wherein 

6 

said information processing system (2) comprises second convergence judging means, 
said at least one route includes a plurality of different routes connecting said processing means (5) f and 
said second convergence judging means confirms said convergence of all said element processors (3) in 
multiplex ways by the messages transferred one after another through said plurality of different routes. 

10 

77. The method of computation performed by using an information processing system (2) of Claim 76, wherein said 
information processing system (2) forms judgement of convergence again by stopping the processing of all said 
element processors (3) or said processing means (5), after the completion of general convergence has been Judged 
by said processes which Judge the general convergence. 

15 

78. The method of computation performed by using an information processing system (2) having a plural number of 
element processors (3) of Claim 30, farther comprising steps of: 

initializing storing means for storing the results of judgement of convergence; and 
*> Judging convergence or not of the computation for said lattice points for the respective lattice points taken 

charge by said element processors (3) and, if it is judged that convergence has not yet been reached, sets 
said storing means to the prescribed logical value. 

79. The information processing system (2) of Claim 1, wherein 

25 

said plural number of element processors (3) share said associated data memory (300) between said adjacent 
element processors (3), and 

each of said plural number of element processors (3) comprises Input/output control unit (301) commonly 
connected to a plural number of data memories (300) shared with said adjacent element processors (3) for 
30 controlling Input/output of data with one of the plural number of data memories (300) by switching the direction 

of communication according to demand for access. 

60. The information processing system (2) of Claim 1, further comprising 

36 a plural number of registers (305) provided in correspondence to said plural number of element processors 

(3) for temporarily storing the results of judgement of a convergence of an operation executed by each of said 
plural number of element processors (3); and 

e wired OR circuit for generating logical product of the output of a plural number of said registers (305). 

4> 81 . The method of computation performed by using an Information processing system (2) of Claim 30, wherein 

a plural number of said element processors (3) share said associated data memory (300) between said adja- 
cent element processors (3), 

each of said plural number of element processors (3) comprises input/output control unit (301) commonly 
46 connected to a plural number of data memories (300) shared with said adjacent element processors (3) for 

controlling Input/output of data with one of the plural number of data memories (300) by switching the direction 
of communication according to demand for access, and 

each of said plural number of element processors (3) refers to the content of said data memory through said 
input/output control unit (301). 

60 

82. The method of computation performed by using an information processing system (2) of Claim 30, wherein said 
Information processing system (2) temporarily stores, In said plural number of registers (305) provided in corre- 
spondence to said plural number of element processors (3), the results of judgement of a convergence of the 
corresponding element processors (3), and executes operations while taking the logical product of said results of 

6* Judgement of said convergence stored in a plural number of said registers (305) with the wired OR circuit. 

83. The method of computation performed by using an information processing system (2) of Claim 30, wherein the 
number of lattice points per one element processor (3) Including unstable lattice points for analysis In which un- 



71 



EP 0 696 001 B1 



stable factors exist Is smaller than the number of lattice points per one element processor (3) not including any 
unstable lattice points for analysis. 



PatantansprQche 

1. Informatlonen verarbeitendes System (2), mlt: 

einer Mehrzahl von Elementarprozessoren (3) Jewells mit Verarbeitungseinrterten (5) zum Durchfuhren von 
Rechenoperetionen und zugehdrigen Datenspef chem (8; 76; 300) zum Speichem von die Rechenoperationen 
betreffenden Daten, zum Durchfuhren von Berechnungen durch Zuordnen einer Mehrzahl von durch rfiumliche 
TeDung elnes Feldes erhattenen Gitterpunkten zu den Elementarprozessoren (3) f unter Beachtung mindestens 
einer bestimmenden Gleichung des Feldes, das In mindestens einer vorgeschriebenen Glelchung unter Ver- 
wendung der Mehrzahl von Elementarprozessoren (3) beschrleben werden kann; und 
einer Mehrzahl von individuellen Oatenbussen (7) v urn eine gegensertige Kommunikatton der Daten zwischen 
benachbarten Elementarprozessoren (3) zu heben, wobei 

die Berechnung durch asynchrones und unabhangiges Betreiben der Elementarprozessoren (3) unter 
Verwendung der durch die Datenbusse (7) Qbertragenen Daten durchgefQhrt wind; 
das Informatlonen verarbeitende System dadurch gekennzelchnet 1st. da A jeder der Elementarprozes- 
soren (3) direkt auf die zugehdrigen Datenspeicher (8; 76; 300) benachbarter Elementarprozessoren (3) 
durch elnen der Individuellen Datenbusse (7)zur Kommunlkatlon dazwlschen schreibt Oder von ihnen liest. 

2. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzelchnet, daft 

der zugeharfge Datenspeicher (8; 76; 300) jedes JeweiOgen Elementarprazessors (3) Datenspeicher (8) in- 
ner halb des Jeweillgen Elementarprozessore (3) aufweist 

3. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzelchnet, daft 

der zugehdrige Datenspeicher (8; 76; 300) jedes Jeweillgen Etemerrtarprozessora (3) elnen au&erhalb des 
Jeweillgen Elementarprozessore (3) entlang einem Datenpfad zwischen dem jeweillgen Elementarprozessor 
(3) und einem dlesem benachbarten Elementarprozessor (3) angeordneten gemeinsamen Speicher (76) auf- 
weist 

4. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzelchnet daft 

der zugehdrige Datenspeicher (8; 76; 300) Jedes Jeweillgen Efementarprozessors (3) duplbderte Daten aus- 
gewfihiter benachbarter Elementarprozessoren (3) hall 

6. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzelchnet, daft 

weiterhln eine Mehrzahl von Zertschritten, die identischen Gitterpunkten entsprechen, den Elementarprozes- 
soren (3) zugeordnet Bind. 

6. Informatlonen verarbeitendes System (2) nach Anspruch 6, dadurch gekennzelchnet daft 

eine Mehrzahl von zweidimensionalen Ebenen (18), in denen die die Bearbertung der durch raumliches Teilen 
elnes zweidimensionalen Feldes erhattenen Gitterpunkte durchfUhrenden Elementarprozessoren (3) zweidi- 
mensional angeordnet slnd, entlang der Zertbasis, die drekfimenslonal aufgebaut werden soil, wenn eine zwei- 
dimensionale Ldsung gesucht wind, angeordnet 1st, und die Elementarprozessoren (3) in Pipeline-Form in der 
Rlchtung der Zertbasis aufgebaut Bind. 

7. Informatlonen verarbeitendes System (2) nach Anspruch 1 , wetter gekennzelchnet durch 

Obertregungselnrichtungen (4) zum Obertragen von Anfangsbedingungen und Randbedingungen zu den Ela- 
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8. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

DateriausteuBchvorgfinge zwlschen den benachbarten Elementarprozessoren (3) individuell fur jeden der Ete- 
mentarprozessoren (3) aayrtchron und mehrfach durchgefQhrt werden. 

0. Informatlonen verarbeitendes System (2) nach Anspnjch 1 , dadurch gekennzeichnet. daft 

Datenaustauschvorgange zwlschen den benachbarten Element arprozessoren (3) Nachrichtenaustauschvor- 
gange sind und mrt mehrlachem Befehlsstrom und mehrfachem Datenstrom durchgefQhrt werden. 

10. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) den Jewells zugehdrigen Datenspeicher (8; 76; 300) der benachbarten Elemerv 
tarprozessoren (3) direkt referenzieren konnen. 

11. Informatlonen verarbeitendes System (2) nach Anspruch 10, dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) den zugehdrigen Datenspeicher (8; 76; 300) der benachbarten Elementarpro- 
zessoran (3) direkt referenzieren bzw. emeuern kdnnen. 

12. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die zugehdrigen Datenspeicher (8; 76; 300) auf den die benachbarten Elementarprozessoren (3) verbinden- 
den LeKungen (7; 302, 303) erste Speicher (76; 300), die von den benachbarten Elementarprozessoren (3) 
geteilt werden, enthatten und den In den ersten Speichem (76; 300) von einer Mehrzahl der mrt den ersten 
Speichem (76; 300) verbundenen Elementarprozessoren (3) gespeicherten InhaJt direkt referenzieren bzw. 
emeuern kdnnen. 

13. Informatlonen verarbeitendes System (2) nach Anspruch 10, dadurch gekennzeichnet, daft 

die zugehdrigen Datenspeicher (6) MuiUport-Speicher (8; 76a) enthalten. 

14. Informatlonen verarbeitendes System (2) nach Anspruch 1 , welter gekennzeichnet durch 

Puffereinrichtungen (80; 304) zum vorubergehenden Puffem von Zugriffeanfragen auf die zugehdrigen Da- 
tenspeicher (6; 76; 300). 

15. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) Jewells Puffers pelcher (76a) haben, in welche von den benachbarten Elemen- 
tarprozessoren (3) gesendete Daten als Daten mit Objektname in Austauschvorgangen zwlschen den be- 
nachbarten Elementarprozessoren (3) geschrieben werden, und 

die Eiementarprozessoren (3) den Empfang von Daten von den benachbarten Elementarprozessoren (3) 
durch Suchen nach Daten mit Objektname in wahlbarer Zeitabstimmung von den Pufferepeichem (76a) ab- 
schlieften. 

16. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) Jewells passende Speicher (83) zum Warten auf die fQr mlndestens dnen der 
Rechenoperattonen notwendigen Daten haben, und 

die Verarbertungseinheit (5) in jedem der Elementarprozessoren (3) mlndestens eine vorgeschriebene Ope- 
ration unter den Rechenoperationen zu dem Punkt im Zeitablauf durchfuhrt, en dem die notwendigen Daten 
In dem passenden Speicher (63) in demselben verfugbar sind. 

17. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) Jewells zum Wiedereinschreiben des die Operationen der Verarbeitungseinhei- 
ten (6) festsetzenden Programms verfOgbare Programmspeicher (11) haben. 
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18. Informational! verarbeitendes System (2) nech Anspruch 1 , dadurch gekennzeichnet. daS 

jade dar Bearbeitungseinheiten (5) basierend auf elnem Datenflu&system arbeitet, das simultane Ausfuhrung 
elner Mehrzahl der verschiedenen Berelchen Jedes Programmspeichera (11) zugeordneten Programmen er- 
mogDcht, und 

jade dar Bearbattungseinhatten (5) die Rechanoperationan einar Mehrzahl der Gitterpunkte simultan durch 
Zuordnen einar Mehrzahl von Programmen zum Bearbeiten dar jeweiligen Gittarpunkta zu verschiedenen 
Bereichen jades Programmspeichers (11) ausfuhrt. 

19. Informatlonen verarbeitendes System (2) nach Anspruch 18, dadurch gekennzeichnet, daft 

das Datenflu&system ein dynamisches datengesteuertes System enthalt, das zum simultanen Ausfuhren ei- 
nes selben Programms mil Eingangsdaten einer Mehrzahl von Satzen unrter Verwertdung von Kennzeichnem 
lahlg 1st. 

20. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzetchnet, daft 

die Verarbemmgseinheiten (5) jeweils fesrverdrahtete Schaltungen (13) haben, in denen arithmetisch-togische 
Eiemente (14) in der auf einem Datenfluttgraph entspechend der verdrahteten Logik fur den teilweisen odar 
ganzen Datenverarbettungsfluft baslarenden Gestalt verdrahtet sind. 

21. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Verarbeitungseinheiten (5) Jewells festverdrahtete Schaltungen (13) nach ein und der selben verdrahteten 
Logik haben, 

die festverdrahteten Schaltungen (13) die zugehdrfgen Datenspei crier (8) der Elementarprozessoren (3) mit 
tntemen Punktbits, die anzeigen, daft die Gitterpunkte innerhalb eines Rechenberelchs in dem Feld oder au- 
fterhalb des Objekts in dem Rechenbereich bezQglich aller Gitterpunkte angeordnet sind, mit den externen 
Punktblts, die anzeigen, daft die Gitterpunkte eufterhalb des Rechenberelchs oder innerhalb eines Objekts in 
dem Rechenbereich angeordnet sind, und mit Randbedingungsbits, die Information en uber Randbedingungen 
bezQgDch der am Rand des Rechenberelchs angeordneten Gitterpunkte anzeigen, im voraus laden, und 
die festverdrahtete Schaltungen (13) die Verarbeitung des Datenrftuftgraphs gemaft den intemen Punktbits, 
den externen Punktbits und den Randbedingungsbits findem. 

22. Informatlonen verarbeitendes System (2) nach Anspruch 20, dadurch gekannzeJchnet, daft 

die Verarbeltungselnheiten (5) welterhln jeweils artthmetlsche Vorrichtungen (1 0) zum dlrekten AusfQhren von 
Operation scodes zum kumulativen Addieren der jeweiligen Ergebnisse von Muitiplikation von erstem Ein gang 
mit zweltem Eingang, drittem - Elngang mit viertem Eingang, funftem Eingang mit sechstem Eingang, (2 
x N - 1)-tern Eingang mit (2 x N)-tem Eingang zu 2N Eingangsdaten haben. 

23. Informatlonen verarbeitendes System (2) nach Anspruch 1 , waiter gekennzeichnet durch 

Felnknotankopplungen (7a), die die banachbarten Elementarprozessoren (3) mlteinander verbinden, und 
Grobknotenkopplungen (7b) , die die Elementarprozessoren (3) mlteinander durch Uberspringen mindestens 
eines der Elementarprozessoren (3) dazwtechen verbinden. 

24. Information en verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) jeweils Ausgabeeinheiten (91) zum asynchronen Lesen von Ausgabedaten, 
Ausstatten dleser Ausgabedaten mit zum Anordnen und Sortieren von Oaten notwendrgen Kennzeichnem 
und Senden von Daten nach aufterhalb der Elementarprozessoren (3) haben. 

25. informatlonen verarbeitendes System (2) nach Anspruch 24, wetter gekennzeichnet durch 

eine Ausgabedaten-Bildverarbeitungseinheit (97) zum Sammeln von Ausgangsdaten von den Ausgabeein- 
heiten (91) der Elementarprozessoren (3) und Erzeugen von sichtbaren BDdem der Ausgangsdaten in Echt- 
zelt, basierend auf den asynchron von alien Elementarprozessoren (3) elntreftenden Ausgangsdaten. 
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26. Information en verarbeitendes System (2) nech Anspruch 1 f welter gekennzeichnet durch 

eine KoordlnaterumwandJungseinheit (120) zum Umwandeln eines ersten, von dem die Elemerrtarprozesso- 
ren (3) steuemden Hauplrechner (1) verwendeten Koordlnatensystems in ein zweites, von den Elementar- 
prozessoren (3) verwendetes Koordinatensystern, wenn Daten vom Hauptrechner (1) zu den Elementarpro- 
zessoren (3) Qbertragen warden, und Umwandeln das zweiten, von den Elementarprozessoren (3) verwendete 
Koordinetensystems in das erste, vom Hauptrechner (1) verwendeten Koordinetensystem, wenn Daten von 
den Elementarprozessoren (3) zum Hauptrechner (1) Qbertragen werden. 

27. informationen verarbeitendes System (2) nach Anspruch 1 , dadurch gekennzeichnet. daft 

die Mehrzahl von Elementarprozessoren (3), die integral von einem Hauptrechner (1) gesteuert werden, Je- 
wells erste In jedem der Elementarprozessoren (3) bereltgestellte und von den Verarbeitungseinheiten (5) 
unabhSnglge Konvergenzbeurteilungseinheiten (163) zur Beurteilung der Konvergenz ohne Stdnen der Be- 
rechnung der Verarbeitungseinheiten (5) aufweisen, und 
das Informationen verarbeitende System (2) welter gekennzeichnet 1st durch: 

erste Kommunikationswege (181) zum Teilen elner Mehrzahl der Elementarprozessoren (3) in eine Mehr- 
zahl von Gruppen und VerWnden derersten Konvergenzbeurteilungselnheilen (183) der jewefligen Ele- 
mentarprozessoren (3) In Form elner unidlrektional geschlosenen Schlerfe oder in elner zentralislerten 
Form in den jeweiligen Gruppen; 

zweile, Jewells mft den ersten Komrnunikatlonswegen (181) verbundene Konvergenzbeurteflungselnhel- 
ten (180) zum Integrteren der Ergebnisse der Konvergenzbeurtejlung aller ersten Kbnvergenzbeurtel- 
lungseinhelten (163) in den Gruppen; und 

zweite Kommunikationswege (182) zum Verbinden aller zweiter Konvergenzbeurteilungseinherten (160) 
der Gruppen in Form elner unidlrektional geschlosenen Schlelfe oder in elner zentralisierten Form. 

28. Informationen verarbeitendes System (2) nach Anspruch 1 , welter gekennzeichnet durch: 

mindsstens einen Wag (1 81 ) zum Obertragen von Nachrichten elner Mehrzahl der Elementarprozessoren (3); 
ein erste8 Konvergenzbeurtellungsmfttel (1 83) zum Beurtellen der Konvergenz auf den Elementarprozessoren 
(3) oder mlndestens einem jedem der Elementarprozessoren (3) zugeordneten Verarbeitungsmittel (6); 
ein erstes Nachrichtensendemittel (183) zum Senden elner ersten die Konvergenz auf einem vorgeschriebe- 
nen Elementarprozessor (3) oder dem Verarbeitungsmittel (5) des vorgeschriebenen Elementarprozessors 
(3) anzeigenden Nachricht, wertn das erste Konvergenzbeurteilungsmittel (183) die Konvergenz auf dem vor- 
geschrlebenen Elementarprozessor (3) oder des Verarbeftungsmittels (5) des selben feststellt; 
ein zweites Nachrichtensendemittel zum Senden elner zweiten, die Konvergenz auf dem die erste oder zwette 
Nachricht von dem benach barton Elementarprozessor (3) erhaltenden Elementarprozessor (3) oder Verar- 
beitungsmittel (5) oder auf dem benachbarten Elementarprozessor (3) nach dem die erste oder zweite Nach- 
richt erhaltenden Elementarprozessor (3) oder Verarbeitungsmittel (5) anzeigenden Nachricht; und 
ein zweites Konvergenzbeurteilungsmittel zum Beurtellen der Konvergenz aller Elementarprozessoren (3) 
durch Obertragen der zweiten Nachricht eine nach der anderen uber den mlndestens einen Wag durch das 
zweite Sendemittel. 

29. Informationen verarbeitendes System (2) nach Anspruch 28, dadurch gekennzeichnet, daft 

der mlndestens eine Weg eine Mehrzahl von verschiedenen die Verarbeitungsmittel (5) verbindenden Wegen 
einschlle&t, und 

das zweite KonvergenzbeurteOungsmlttel die Konvergenz aller Elementarprozessoren (3) auf mehrfache Wei- 
ss durch die eine nach der anderen uber die Mehrzahl von verschiedenen Wegen Qbertragenen Nachrichten 
bestatlgt 

30. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren, wobel 

das Informationen verarbeitende Systems (2) 

eine Mehrzahl von Elementarprozessoren (3). jewefls mft Verarbeitungseinheiten (5) zum Durchfuhren von 
Rechenoperationen und zugehd rigen Datenspeichem (8; 76; 300) zum Speichern von die Rechenoperationen 
betreflenden Daten zum DurchfQhren von Berechnungen durch Zuordnen elner Mehrzahl von durch riumliche 
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TeQ ung eines Feldes erhaftenen Gitterpunkten zu den Elementarprozessoren (3), urrter Beachtung mindestens 
einer bestimmenden Gleichung des Feldes, das in mindestens einer vorgeschriebenen Glelchung unter Ver- 
wendung der Mehrzahl von Elementarprozessoren (3) beschrieben warden kann; und 
eine Mehrzahl von indlvfduellen Datenbussen (7) autwetet, urn elne gegenseitlgen Kommunlkation der Daten 
zwtechen benachbarten Elementarprozessoren (3) zu haben, wobei 

die Berechnung durch asynch rones und unabhangiges Betreiben der Elementarprozessoren (3) unter 
Verwendung der durch die Datenbusse (7) Qbertragenen Daten durchgefuhrt wind; 
das InformatJonen verarbeitende System dadurch gekennzeichnet 1st daft jeder der Elementarprozes- 
soren (3) direkt auf die zugehdrigen Datenspeicher (8; 76; 300) benachbarter Elementarprozessoren (3) 
durch einen der Indivlduellen Datenbusse (7) zur Kommunlkation. dazwfschen schrelbt oder von ihnen 
lies! 

31. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgetQhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

der zugehdrige Datenspeicher (B; 76; 300) jedes jeweiligen Elementarprozessors (3) Datenspeicher (6) in 
dem jeweiOgen Elementarprozessor (3) aufWeist 

32. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 1, dadurch gekennzeichnet, daft 

der zugehdrige Datenspeicher (8; 76; 300) jedes jeweiligen Elementarprozessors (3) einen auGerhalb des 
jeweiligen Elementarprozessors (3) entlang einem Datenpfad zwischen dem jeweiligen Elementarprozessor 
(3) und einem diesem benachbarten Elementarprozessor (3) angeordneten gemeinsamen Speicher (76) euf- 
weist 

33. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgefGhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

dupllzierte Daten ausgewShlter benachbarter Elementarprozessoren (3) auf die zugehdrigen Datenspeicher 
(8; 76; 300) jedes jeweiligen Elementarprozessors (3) geschrieben werden. 

34. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgefGhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

weltertiln elne Mehrzahl von Zeitschrttten, die Identischen Gitterpunkten entsprechen, den Elementarprozes- 
soren (3) zugeordnet tet. 

36. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgetQhrtes Rechenverfahren nach Art- 
spruch 34, dadurch gekennzeichnet, daft 

eine Mehrzahl von zweidimensionalen Ebenen (18), in denen die die Bearbertung der durch raumlfches Teilen 
eines zweidimensionalen Feldes erhaftenen Gitterpunkte durchfuhrertden Elementarprozessoren (3) zweidi- 
mensional angeordnet sind, entlang derZertbasis, diedrekfimensional aufgebaut werden soil, wenn elnezwei- 
dlmenslonale Ldsung gesucht wird, angeordnet 1st, und die Elementarprozessoren (3) Qberiappend In der 
RIchtung der Zertbasis autgebaut sind. 

30. Unter Verwendung eines Intormatlonen verarteltenden Systems (2) durchgefGhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die mindestens eine vorgeschriebene Glelchung mindestens elne Differential I elchung einschlleftt, und 
die mindestens elne vorgeschriebene Glelchung die Ldsung der mindestens einen Differentf algleichung durch 
asynchrones und unabhangiges Betdtigen der Elementarprozessoren (3) mit der Methode der finiten DifTe- 
renzen erhfllt 

37. Unter Verwendung eines Intormatlonen veraroeltenden Systems (2) durchgefGhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet daft 
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die mindestens eine vorgeschriebene Differentialgleichung mindestens elne partielle Differentialgleichung 
Oder mindestens elne gewdhnllche Differentialgleichung elnschlleBt 

38. Unter Verwendung elnes Intormationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzalchnet, daft 

die Etementarpruzessoren (3) die Berechnung einer Mehrzahl von durch rfiumliche Teilung des Feldes erhal- 
tenen Gitterpunkten jeweils Obernehmen, und 

die den Gitterpunkten entsprechende Nummer Oder die den Gitterpunkten auf den zugehdrigen Datenspei- 
chem (8; 76; 300) entsprechende Basisadresse des Datenbereichs als Eingangstoken zur Steuerung der von 
den Elementarprozessoren (3) durchgefOhrfen Berechnung verwendst wird. 

39. Unter Verwendung eines Intormationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfartren nach An- 
spruch 30, dadureh gekennzalchnet, daft 

das Informationen verarbeitende System (2) die Berechnung unter Verwendung von ObeHragungsvorrichtun- 
gen (4) zum Ubertragen von Anfangsbedingungen und Randbedingungen zu den Elementarprozessoren (3) 
durchfQhrt. 

40. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzeichnet, daE 

Datenaustauschvorgdngezwi8Chen den benachbarten Elementarprozessoren (3) individual fQr jeden der Ele- 
mentarprozessoren (3) asynchron und mehrfach durchgefOhrt warden. 

41. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzeichnet, daft 

Datenaustauschvorggngezwischen den benachbarten Elementarprozessoren (3) Nachrichtenaustauschvor- 
gdnge slnd und mft mehrfachem Befehlsstrom und mehrfachem Datenstrom durchgefOhrt warden. 

42. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzeichnet, daS 

die Elementarprozessoren (3) die jeweils zugehdrigen Datenspeicher (8; 76; 300) der benachbarten Elemen- 
tarprozessoren (3) direkt referenzieren. 

43. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 36, dadureh gekennzeichnet, daft 

die Elementarprozessoren (3) den Jeweils zugehdrigen Datenspeicher (8; 76; 300) der benachbarten Elemen- 
tarprozessoren (3) direkt referenzieren und emeuem. 

44. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzeichnet, daft 

die zugehdrigen Datenspeicher (8; 76; 300) auf den die benachbarten Elementarprozessoren (3) verbinden- 
den Leitungen (7; 302, 303) erste Speicher (76; 300), die von den benachbarten Elementarprozessoren (3) 
geteilt warden, enthaiten und den In den ereten Speichem (76; 300) von einer Mehrzahl der mit den ersten 
Spelchem (76; 300) verbundenen Elementarprozessoren (3) gespeicherten Inhalt direkt referenzieren und 
emeuem. 

45. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 42, dadureh gekennzalchnet, daft 

die zugehdrigen Datenspeicher (8) Multiport-Speicher (8; 76a) enthaiten. 

46. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
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spruch 30, dadurch gekennzeichnet, daS 

das Informationen verarbeitende System (2) die Berechnung unter Verwendung von Puffereinrichtungen (80; 
304) zum vorQbergehenden Puftern von Zugrlffeanfragen auf die zugehflrtgen Datenspelcher (8; 76; 300) 
durchfOhrt. 

47. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Recnenverfahren nach An- 
spruch 30, dadurch gekennzalchnet, dafi 

die Elementarprozessoren (3) jeweils Pufferspelcher (76a) umfassen, in welche von den benachbarten Ele- 
mentarprozessoren (3) gesendete Daten als Daten mft Objektname in AustauschvorgSngen zwischen den 
benachbarten Elementarprozessoren (3) geschrieben warden, und 

die Elementarprozessoren (3) den Empfang von Daten von den benachbarten Elementarprozessoren (3) 
durch Suchen nach Daten mrt Objektname in w&hlbarerZeitabstimmung von den Pufferepetahem (76a) ab- 
schlieften. 

48. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefQhrtes Recnenverfahren nach An- 
spruch 30, dadurch gekennzalchnet, daft 

die Elementarprozessoren (3) Jeweils passende Speicher (63) zum Warten auf die fOr mlndestens einen der 
Rechenoperationen notwendigen Daten umfassen, und 

die Verarbettungseinhelt (6) in Jedem der Elementarprozessoren (3) mlndestens eine vorgeschrfebene Ope- 
ration unter den Rechenoperationen zu dem Punkt im ZeitaWauf durchfuhrt, an dem die notwendigen Daten 
in dem passenden Speicher (83) in demselben veriOgbar sind. 

49. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Recnenverfahren nach An- 
spruch 30, dadurch gekennzalchnet,, daft 

die Elementarprozessoren (3) jeweils die zum Wlederelnschreiben des die Rechenoperationen der Verarbei- 
tungseinhelten (5) festsetzenden Programms verfugbare Programmspeicher (11) umfassen. 

60. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefQhrtes Recnenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Verarbeitungseinhelten (6) die Rechenoperationen under Verwendung ein und des selben Rechenpro- 
gramms durchfOhren, 

das Rechenprogramm die zugehdrlgen Datenspelcher (8) der Elementarprozessoren (3) mrt Intemen Punkt- 
bits, die anzelgen, daft die Gitterpunkte Innemalb eines Rechenbereichs in dem Feld Oder aufterhalb des 
ObJeWs in dem Rechenbereich bezQglich alter Gitterpunkte angeordnet sind, mit den extemen Punktbits, die 
anzelgen, daft die Grtterpunkte aufterhalb des Rechenbereichs Oder limerhalb eines Objekts In dem Rechen- 
bereich angeordnet sind, und mit Randbedingungsbits, die Informationen Ober Randbedingungen bezQglich 
der am Rand des Rechenbereichs angeordneten Gitterpunkte anzeigen, im voraus laden, und 
das Rechenprogramm die verarbeitung auf den Elementarprozessoren (3) gemdft den intemen Punktbits, 
den extemen Punktbits und den Randbedingungsbits findert 

61. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefQhrtes Recnenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

jede der Bearbeltungselnhelten (6) basierend auf einem Datenfluftsystem arbertet, das slmuftane AusfQhrung 
einer Mehrzahl der verschiedenen Bereichen jedes Programmspeicfiers (11) zugeordneten Prog ram men er- 
m6gDcht f und 

Jede der Bearbeltungselnhelten (5) die Rechenoperationen einer Mehrzahl der Gitterpunkte simultan durch 
zuordnen einer Mehrzahl von Programmer! zum Bearbeften der jeweiligen Gitterpunkte zu verschiedenen 
Bereichen jedes Programmspeichers (11) ausfuhrt. 

52. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Recnenverfahren nach An- 
spruch 51, dadurch gekennzeichnet. daft 
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das DatenfluQsystem ©In dynamisches datengesteuertes System enthfilt, das zum simultanen AusfOhren ein 
und des selben Programms mft Hngangsdaten einer MehrzahJ von Sfitzen unter Verwendung von Kennzelch- 
nern fghlg ist 

63. Unter Verwendung elnes Intbrmatlonen verartoeitenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
spruch 52, dadurch gekennzeichnet, daft 

die Verarbeitungseinheiten (6) die Adresse der zugehdrigen Datenspeicher (8), die benachbarte Elementar- 
prazessoren (3) betreffende Daten speichem, berechnen und die zugehdrigen Datenspeicher (8) unter Ver- 
wendung der Adresse der zugehdrigen Datenspeicher (8) auf der Gmndlage des Werts des den Eingangs- 
daten zugeordneten Kennzeichners referenzJeren und erneuern. 

54. Unter Verwendung elnes Informationen verartoeltenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Verarbeitungseinheiten (5) Jewells festverdrahtete Schaitungen (13) haben, in denenarithmetisch-logische 
Elemente (14) in einer auf einem Datenfl ufSgraph entspechend der verdrahteten Logik fQr den teilweisen Oder 
ganzen Datenverarbeitungsfluft basierenden Gestalt verdrahtet sind. 

55. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Verarbeitungseinheiten (6) Jewells festverdrahtete Schaitungen (13) nach ef n und der selben verdrahteten 
Logik haben, 

die festverdrahteten Schaitungen die zugehdrigen Datenspeicher (8) der Elementarprozessoren (3) mit inter- 
nen Punktbite, die anzelgen, daft Qitterpunkte innertiaJb eines Rechenbereichs in dem Feld Oder aufterhalb 
des Objekts In dem Rechenbereich beztigiich aller Gitterpunkte angeordnet sind, mit den extemen Punktbits, 
die anzelgen, daft die Gitterpunkte aufierhalb des Rechenbereichs Oder innerhaJb elnes Objekts in dem Re- 
chenbereich angeordnet sind, und mit Randbedlngungsbits, die Informationen Ober Randbedingungen bezOg- 
Dch der am Rand des Rechenbereichs angeordneten Gitterpunkte anzelgen, im voreus laden, und 
die festverdrahteten Schaitungen (13) die Verarbettung des Datenfluftgraphs gemafi den intemen Punktblts, 
den externen Punktbits und den Randbedlngungsbits andem. 

56. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 54, dadurch gekennzeichnet, daft 

die Verarbeitungseinheiten (5) Jewells arlthmetJsche Vorrtchtungen (10) zum direkten AusfOhren von Opera- 
tlonscodes zum kumuiatfven Addleren der jeweiligen Ergebnisse von Multipllkation von erstem Eingang mft 

zweltem Eingang, drittem Eingang mit viertem Eingang, funftem Eingang mit sechstem Eingang (2 x N - 

1>4em Eingang mit (2 x N)-tem Eingang zu 2N Eingangsdaten haben. 

57. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 54, dadurch gekennzeichnet daft 

das Informationen verarbeitende System (2) die Haufigkeit der Austauschvorgange an alien Gitterpunkten 
durch DurchfOhren von leeren Auatauschvorgfingen auch fQr die Gitterpunkte mit gertnger Hflufigkeit von Aus- 
teuschvorgangen zum Ldsen der mindestens einen besUmmenden Gleichung des Feldes niveJIiert, well die 
das Objekt der Berechnung bildenden Gitterpunkte am Rand von Bereichen gefonden warden. 

58. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet daft 

das Informationen verarbeitende System (2) die Hflufigkeit der Operationen an alien Gitterpunkten durch 
DurchfOhren von Scheinoperartionen auch fQr Gitterpunkte mit gertnger Operationshaufigkeit zum Ldsen der 
mindestens einen bestlmmenden Gleichung des Feldes nlvelllert, weil die das Objekt der Berechnung bilden- 
den Gitterpunkte am Rend von Bereichen gefonden werden. 

69. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
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spruch 30, dadurch gekennzeichnet, daft 

das Informatlonen verarbeitende System (2) ein Bearbeiten jedes Gitterpunkts zum Finden einer Ldsung min- 
destens einer DlfferentJalglelchung mit der Methode der flniten Dtfferenzen und AustauschvorgSnge zwlschen 
o den Jeweillgen Qltterpunkten durchfOhrt, Informatlonen, welche die Notwendlgkeit Oder rtfcht einer Kommunl- 

kation und die Austauschmethode fQr Jeden Gitterpunkt anzeigen. 

60. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

10 

das Informatlonen verarbeitende System (2), das ein Bearbeiten Jedes Gitterpunkts zum Rnden einer Ldsung 
mlndestens einer Differentiatgleichung mit der Methode der finiten Drfferenzen durchfQhrt, jedem Gitterpunkt 
elnen Spelcherberelch zuweial, Austauschvorgfirtge zwtschen Informatlonen von benachbarten Qltterpunkten 
dlrch den Spelcherberelch durchfuhrt und Jeden der Gitterpunkte mit Informatlonen zum Anzeigen des Spel- 
ts cherbereichs versorgt, der zur fur die KommunOcation in jeder Richtung, In der Kommunikation erfolgen solite, 
vorgesehen 1st. 

61. Unter Verwendung elnes Informatlonen verarbettenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 60, dadurch gekennzslchnet, daft 

20 

dm Informatlonen verarbeitende System (2) jeden der Gitterpunkte mrt Information verscrgt, die einen ellge- 
melrten Leerbereich In der Richtung, In der Kommunikation unmdgllch 1st, anzelgt 

62. Unter Verwendung elnes Informatlonen verarbettenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
2 * spruch 30, dadurch gekennzeichnet, daB 

wenn die benachbarten Gitterpunkte Qber verschtedene Elernerttarprozessoren (3) zugeordnet werden, die 
Elementarproze8soren (3) auf belden Seiten zwei Datenpaare rechtzeitig (Or eine Zwlschenkommunlkation 
halten und, wenn die Daten fOr diese Gitterpunkte emeuert worden slnd, die Prozessoren (3) die Daten nterrt 
30 wledereinschrelben, falls irgendeine Verschiebung der emauerten Daten hdchstens so grott wie der vorge- 

schrtebene Wert Im Vergleich zu den Daten des chronologisch vorhergehenden Paars 1st 

63. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 60, dadurch gekennzeichnet, daft 

35 

das informatlonen verarbeitende System (2) den zugehdrigen Datenspeichem (8) virtu elle Datenbereictie zu- 
weist die Gitterpunkte simulleren, welche dem Rand des das Objekt der Berechnung der mlndestens elnen 
vorgeschriebenen Glelchung darsteilenden Rechenberelchs am nSchsten beflnden und auGerhalb des Re- 
chenbereichs Hegen. 

40 

S4. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

das Informatlonen verarbeitende System (2) eine Feinknotenkopplung (7a), die die gegenseitig benachbarten 
45 Elementarprozessoren (3) miteinander verbindet, und eine Grobknotenkopplung (7b), die die Elemerrtarpro- 

zessoren (3) mlteinander durch Uberaprlngen mlndestens elnes der Qemenitarprozessoren (3) dazwischen 
verbindet, aufweist 

66. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
50 spruch 64, dadurch gekennzeichnet, daft 

das Informatlonen verarbeitende System (2) die Berechnung der mlndestens einen vorgeschriebenen Glei- 
churtg durch abwechselndes AusfQhren einer Berechnung durch die durch die Grobknotenkopplung (7b) ver- 
bundenen Elementarprozessoren (3) und Berechnung durch die durch die Feinknotenkopplung (7a) verburv- 
55 denen Elementarprozessoren (3) fortfuhrt. 

66. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 
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das Informetionen verarbeilende System (2) die Berechnung der mindestens einen bestimmenden Gleichung 
des Feides unter Verwendung nur elnes Tells der ausgewihlten Gitterpunkte parallel zur Berechnung der 
mlndestens elnen bestimmenden Gleichung des Feides unler Verwendung aller Gitteipunkte durchfuhrt 

67. Unter Verwendung elnes Informetionen verarbeitenden Systems (2) durchgeftlhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

sich der Algoritfimus zur Berechnung der mlndestens einen bestimmenden Gleichung des Feides, der nur 
ausgewahtte Gitterpunkte verwendet, von dem Algorithmus zur Berechnung der mindestens einen bestim- 
menden Gleichung des Feides unterscheidet, der alle Gitterpunkte verwendet 

68. Unter Verwendung elnes Informetionen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 67, dadurch gekennzeichnet, daft 

das Informatlonen vererbeitende System (2) mindestens zwei gegenseitig benachbarte Gitterpunkte betref- 
fende Detenpaare in zeitlicher AbfoJge hat und die Operation der Gitterpunkte abbricht, wenn die Summe der 
Schwankung zwiscnen den Datenpaaren nicht grofier 1st a Is der vorgeschriebene Wert 

66. Unter Verwendung elnes Informetionen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 66, dadurch gekennzeichnet. daS 

der Wert der vorgeschriebenen Summe fOr das Untertassen der Operation der Gitterpunkte, der vom Wert der 
Schwankung zwiscnen zwei Datenpaaren abhangt, gemSfi dem Grad der Konvergenz reduziert wird. 

70. Unter Verwendung elnes Informetionen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Elementarprozessoren (3) Jewells Ausgabeelnhelten (91) zum asynchronen Lesen von Ausgabedaten, 
Ausstatten dieser Ausgabedaten mit zum Anordnen und Sortleren von Daten notwendigen Kennzelchnem 
und Senden von Daten nach auGemalb der Elementarprozessoren (3) haben. 

71. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 70, dadurch gekennzeichnet, daft 

das Informatlonen verarbeitende System (2) eine Ausgabedaten-Bildverarbeitungseinhert (97) zum Sammein 
von Ausgangsdaten von den Ausgabeeinheiten (91) der Elementarprozessoren (3) und Erzeugen von sicht- 
baren Blldem der Ausgangsdaten In Echtzelt, basterend auf den asynchron von alien Elementarprozessoren 
(3) elntreffenden Ausgangsdaten, eufweist. 

72. Unter Verwendung elnes Informatlonen verarbeitenden Systems (2) durchgefOhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

das Informetionen verarbeitende System (2) eine Koofdinatenumwandlungseinhert (120) zum Umwandeln 
elnes ersten, von dem die Elementarprozessoren (3) steuemden Hauptrechner (1) verwendeten Koordina- 
tensystems in ein zweites, von den Elementarprozessoren (3) verwendetes Koordinatensystem, wenn Daten 
vom Hauptrechner (1) zu den Elementarprozessoren (3) Obertragen warden, und Umwandeln des zwelten, 
von den Elementarprozessoren (3) verwendeten Koordinatensystems in das erste, vom Hauptrechner (1) 
verwendeten Koordinatensystem, wenn Daten von den Elementarprozessoren (3) zum Hauptrechner (1) Ober- 
tragen werden, aufWelst. 

73. Unter Verwendung elnes Informetionen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Mehrzahl von Elementarprozessoren (3) eine Mehrzahl von Elementarprozessoren (3) ist, die integral von 
elnem Hauptrechner (1) gesteuert werden, 

erste in jedem der Elementarprozessoren (3) berertgesteDte und von den Verarbeitungseinherten (5) unab- 
hangige Konvergenzbeurte Hung seinheiten ( 1 83) zur Beurteilung der Konvergenz ohne Stdren der Berechnung 
der Verarbeitungseinherten (5) aufWeisen, und 
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das Informationen verarbeitende System (2) gekennzeichnet 1st durch 

erste Kommunikationswege (181) zum Teilen einer Mehrzahi der Elementarprozessoren (3) in eine Mehr- 
zahl von Gruppen und zum Verbinden der eraten Konvergenzbeurteilungselnhelten (183) der Jeweillgen 
Elemenrtarprozessoren (3) in Form elner unidlrektlonai geschlosenen Schlelfe Oder in elner zentralislerten 
Form in den jeweillgen Gruppen, und 

zweite Kommunikationswege (182), die mit jedem der ersten Kommunikationswege (181) der Gruppen 
verbunden sind und alle Konvergenzbeurteilungseinhelten (183), die die Ergebnisse der Konvergenzbe- 
urteilung aller Konvergenzbeurteilungseinhelten (183) in den Gruppen und aller Konvergenzbeurteilungs- 
einhelten (183) der Gruppen tntegrieren, in Form einer unidirektional geschlosenen Schlelfe oder in einer 
zentrallsierten Form verblnden. 

74. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefflhrtes Rechenverfahren nach An- 
spruch 30, dadureh gekennzeichnet, daft 

das Informationen verarbeitende System (2) mindestens einen Weg (181) zum Obertragen von Nachrichten 

elner Mehrzahi der Elementarprozessoren (3) aufWeist, und 

das Rechenverfahren welter gekennzeichnet 1st durch die Schritte 

(a) Beurteilen der Konvergenz auf Jedem der Elementarprozessoren (3) oder mindestens elnem Verar- 
beitungsmrttel (5). das Jedem der Elementarprozessoren (3) zugeordnet isfc 

(b) Senden elner eraten Nachricht, die elnen vorgeschrlebenen Elementarprozessor (3) oder eine Verar- 
beitungsvorrlchtung (6) des vorgeschrlebenen Elements (3) anzelgt, wenn das Informationen verarbei- 
tende System (2) die Konvergenz auf dem vorgeschrlebenen Elementarprozessor (3) oder dem Verar- 
beitungsmrttel (5) des vorgeschrlebenen Elements (3) feststellt 

(c) Senden elner zweiten Nachricht, die die Konvergenz auf dem die erste oder zweite Nachricht von dem 
benachbarten Elementarprozessor (3) ertialtenden Elementarprozessor (3) oder Verarbeitungsmittel (5) 
Oder dem benachbarten Elementarprozessor (3) nach dem die erste oder zweite Nachricht erhaltenden 
Elementarprozessor (3) oder Verarbeitungsmittel (5) anzelgt; und 

(d) Beurteilen der Konvergenz aller Elementarprozessoren (3) dutch Obertragen der zweiten Nachricht 
eine nach der anderen durch Wlederholen des Schrttts (c). 

75. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 74, dadureh gekennzeichnet, daB 

das Informationen verarbeitende System (2) Schrftt (a) bis Sennit (d) fur eine Mehrzahi von Bestatigungen 
der Konvergenz aller Elementarprozessoren (3) wlederhoit 

76. Unter Verwendung eines Informationen verarbeitenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
s patch 76, dadureh gekennzeichnet, daft 

das Informationen verarbeitende System (2) ein zweites KonvergenzbeurteQungsmittel aufweist, 

der mindestens eine Weg eine Mehrzahi von verschiedenen, die Verarbeitungsmittel (5) verbindenden Wegen 

einschlieBt, und 

das zweite KonvergenzbeurteQungsmittel die Konvergenz alter Elementarprozessoren (3) auf mehrfache Wei- 
8e durch die eine nach der anderen Qber die Mehrzahi von verschiedenen Wegen Qbertragenen Nachrichten 
bestSUgt 

77. Unter Verwendung elnes Informationen verarbeitenden Systems (2) durchgefflhrtes Rechenverfahren nach An- 
spruch 76, dadureh gekennzeichnet, daB 

das Informationen verarbeitende System (2) eine Beurteilung der Konvergenz durch Abbrechen der Verarbel- 
tung aller Elementarprozessoren (3) oder der Verarbeitungsmittel (5) emeut erstellt, nachdem die Vollendung 
elner aligemelnen Konvergenz durch die die aDgemeine Konvergenz feststellenden Prazesse festgesteilt wor- 
dentet. 

78. Unter Verwendung elnes I nformatione n verarbeitenden Systems (2) mit einer Mehrzahi von Elementarprozessoren 
(3) durchgefuhrtes Rechenverfahren nach Anspruch 30, welter gekennzeichnet durch die Schritte: 
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Initlalisieren von Speichermitteln zum Spelchern der Ergebnisse der Konvergenzbeurteilung; und 
Feststellen von Konvergenz oder nlcht der Berechnung fOr die Gitterpunkte fOr die jeweiDgen von den Ete- 
mentarprozessoren (3) tibemommenen Gitterpunkte und, falls festgestellt wurde, daB Konvergenz nicht er- 
retehl wurde, Festlegen der Speichervorrlchtungen auf die vorgeschriebenen logischen Werte. 

79. Informatlonen verarbeitendes System (2) nach Anspruch 1 , dadurch gekenraeichnet, daB 

die Mehrzahl von Elementarprozessoren (3) diezugehorigen Datenspeicher (300) zwischen den benachbarten 
Elementarproze88oren (3) teltt, und 

jeder der Mehrzahl von Elementarprozessoren (3) elne mht elner Mehrzahl von mft den benachbarten Elemen- 
tarprozessoren (3) getellten Daten&peichem (300) gemeinsam verbundenen Elngabe-ZAusgabe-Kontrollelrv- 
heiten (301) zur Kontrolle des Elngangs/Ausgangs von Daten mil einem der Mehrzahl von Datenspeichem 
(300) durch UmschaJten der Kommunikationsrichtung gemifi einem Zugrfflsbedarf auftoeist. 

80. Informatlonen verarbeitendes System (2) nach Anspruch 1 , wetter gekennzeichnet durch 

elne Mehrzahl von In Ubereinstlmmung mft der Mehrzahl von Elementarprozessoren (3) bereitgestellten Re- 
glstem (305) zum vortibergehenden Spelchern der Ergebnisse der Konvergenzbeurteilung elner von jedem 
der Mehrzahl von Elementarprozessoren (3) atisgefuhrten Operation; und 

einer verdrahteten ODER-Schaltung zum Erzeugen elnes logischen Produkts des Ausgangs elner Mehrzahl 
der Register (305). 

81. Unter Verwendung elnes Informatlonen verarbeltenden Systems (2) durchgefuhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

elne Mehrzahl der Elementarprozessoren (3) die zugehdrigen Datenspeicher (300) zwischen den benachbar- 
ten Elementarprozessoren (3) teirt, 

Jeder der Mehrzahl von Elementarprozessoren (3) eine mil elner Mehrzahl von mft den benachbarten Elemen- 
tarprozessoren (3) getellten Datenspeichem (300) gemeinsam verbundenen Elngabe-ZAusgabe-KontroUelrv 
heiten (301) zur Kontrolle des Elngangs/Ausgangs der Daten mi! einem der Mehrzahl von Datenspeichem 
(300) durch Umschalten der Kommunikationsrichtung gem&fi einem Zugrfflsbedarf auftoeist, und 
jeder der Mehrzahl von Elementarprozessoren (3) den Inhalt des Datenspeichere durch die Eingabe-ZAusga- 
be-Kontrolleinhelt (301) referenzJert 

82. Unter Verwendung elnes Informatlonen verarbeltenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet daft 

das Informationen verarbeitende System (2) die Ergebnisse der Konvergenzbeurteilung des korrespondieren 
Elementarprozessors (3) In der Mehrzahl von in Qbereinstimmung mit der Mehrzahl von Elementarprozesso- 
ren (3) bereitgestellten Reglstem (305) vorQbergehend spetehert, und Operattonen ausfOhrt, wahrend es das 
bglsche Produkt des In elner Mehrzahl der Register (305) gespeicherten Ergebnisse der Beurteilung der Kon- 
vergenz mit der verdrahteten ODER-Schaltung nimmt. 

83. Unter Verwendung elnes Informationen verarbeltenden Systems (2) durchgefQhrtes Rechenverfahren nach An- 
spruch 30, dadurch gekennzeichnet, daft 

die Anzahl der Gltterpunkte fQr einen Elementarprozessor (3), der instabile Gitterpunkte zur Analyse, in der 
es instabile Faktoren gibt, enthfirt, kleiner ist als die Anzahl der Gitterpunkte fur einen Elementarprozessor 
(3), der kelne Instabllen Gitterpunkte zur Analyse enthalt. 



Revendlcations 

1. Systeme de traitement ^informations (2), comprenant : 

una plurality de processeurs d'element (3) ayant respectivement des unites de traitement (5) pour reeliser 
des operations de calcul et des memoires de donnees associees (8, 76 ; 300) pour stocker des donnees 
concemant lesdttes operations de calcul pour reallser le calcul en asslgnant, en ce qui concern e au moins 
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une Equation d'administration d'un champ qui peut etre decrit dans au moins una Equation prescrite en utilisant 
ladito pluraifte de processeurs d'element (3), una pluraJlte da points da reseaux obtenus en dtvlsant spatiale- 
mant I adit champ pour lesdits processeurs d'element (3) ; et 

una plurallte de bus Indlviduels de donnees (7) pour avoir des communications mutueOes desdites donnees 
s entre ceux adjacents desdlts processeurs d'element (3) ; ou 

I adit calcul est realise en faisant fonctionner lesdits processeurs d'element (3) de facon asynchrone et 
Ind6pendante en utilisant lesdites donnees transmises via lesdits bus de donnees (7) ; 
I edit systems de traite merit dlnformatlons (2) 6tant caract6r1s6 en co que chacun desdlts processeurs 
*o d'element (3) ecrit ou lit dlrectement dans ou a partir desdlts memoires de donnees associees (8 ; 76 ; 

300) des processeurs d'element adjacents (3) via un desdlts bus indlviduels de donnees (7) pour une 
communication antra eux. 

2. Systeme de traitement d'i informations (2) selon la revendication 1, dans lequel ladite memoire de donnees asse- 
ts ciees (8, 76, 300) de chaque processeur d'element respect rf (3) comprend une memoire de donnees (8) dans ledrt 

processeur d'element respactif (3). 

3. Systeme de traitement ^informations (2) selon la revendlcatlon 1, dans lequel ladite memoire de donnees asso- 
ciees (6, 78, 300) de chaque processeur d'6l6ment respectif (3) comprend une memoire partagee (76) disposee 

20 en dehors du dit processeur d'element respectif (3) le long (fun chemln de donnees entre ledit processeur d* element 
respectif (3) et un processeur d'element (3) adjacent a celui-cl. 

4. Systeme de traitement ^informations (2) selon la revendfcaUon 1 , dans lequel ladite memoire de donnees asso- 
ciees (8 ; 76 ; 300) de chaque processeur d'element respectif (3) contient des donnees dupliquees des processeurs 

25 d'element adjacents select tonne (3). 

6. Systeme de traitement conformations (2) selon la revendlcatlon 1 , dans lequel une pluralit6 d*6tapes temporelles 
correspondant a cedes Hentiques desdlts points de reseaux sont en outre assignees auxdits processeurs d^leV 
ment(3). 

30 

8. Systeme de traitement dlnformatlons (2) selon la revendlcatlon 6, dans lequel une pluralite de plans Wdlmenslon- 
nels (18), dans lesquels lesdits processeurs d'element (3) realisarrt le traitement desdlts points de reseaux obtenus 
en divisant spatialement un champ bidimensionnel sont disposes bidimensionnellement, sont disposes le long de 
la base de temps a construire trldimensionnellement, lorsqu'une solution anatytique bidimensionnelle est recher- 

36 chee, et lesdits processeurs <f6lement (3) sont constnjits en pipeline dans la direction de la base de temps. 

7. Systeme de traitement ^Informations (2) selon la revendlcatlon 1 , comprenant en outre un moyen de transmission 
(4) pour transmettre des conditions Initiates et des conditions de llmJte auxdits processeurs d'element (3). 

to 8. Systeme de traitement dlnformatlons (2) selon la revendlcatlon 1, dans lequel des communications de donnees 
entre lesdits processeurs d'element adjacents (3) sont realisees individuellement pour chacun desdlts processeurs 
d'element (3), de facon asynchrone et repetitive. 

9. Systeme de traitement dlnformatlons (2) selon la revendication 1, dans lequel des communications de donnees 
4* entre lesdits processeurs cfel6ment adjacents (3) sont des communications de message et realisees avec un flux 

de multiples instructions et un flux de multiples donnees. 

10. Systeme de traitement dlnformations (2) selon la revendlcatlon 1, dans lequel lesdits processeurs d'element (3) 
peuvent so referer dlrectement, respecttvemerrt, a ladite memoire de donnees associees (8, 78, 300) desdlts 

60 processeurs d'element adjacents (3). 

1 1 . Systeme de traitement dlnformatlons (2) selon la revendlcatlon 1 0, ou lesdits processeurs d'element (3) peuvent 
se referer et renouveler dlrectement, respect rvement, lesdites donnees associees (8, 76, 300) desdlts processeurs 
d'element adjacents (3). 

09 

12. Systeme de traitement dlnformations (2) selon la revendication 1, dans lequel lesdits me moires de donnees as* 
sociees (8, 76, 300) comprennent, sur les routes (7, 302, 303) reliant lesdits processeurs d'element adjacents (3), 
des premieres memoires (76, 300) partagees par lesdits processeurs d'element adjacents (3) et peuvent se referer 
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et renouveler diredement ie oontenu stocke dens lesdites premieres memoires (76, 300) a partir cfune pturalite 
desdits process eu re d , 6!6ment (3) raccordes auxdHes premieres memoires (76, 300). 

13. Systeme de traltement dlnformations (2) seton la revendicatlon 10, dans lequel lesdtts memoires de donnees 
o assodees (8) comprennent des memoires multiports (8, 76a). 

14. Systeme de traltement dlnformations (2) selon la revendicatlon 1 , comprenant en outre un moyen de tampon (80, 
304) pour mettre en tampon temporal rement une demande d'acces auxdites memoires de donnees associees (8, 
78; 300). 

10 

15. Systeme de traltement ^informations (2) seton la revendicatlon 1 , dans lequel : 

lesdtts processeurs cfelement (3) ont respectlvement des mdmcires tampons (76a) dans lesquelles les don- 
nees envoyees par lesdlts processeurs cfelement adjacents (3) sont ecrltes comme des donnees avec un 
™ nom d'objet dans des communications entre lesdits processeurs d'etement adjacents (3), et 

lesdtts processeurs d'etement (3) completer* la reception des donnees 6 partir desdits processeurs clement 
adjacents (3) en recherchant des donnees avec un nom d'objet d une temporisation optionnelle k partir des- 
dites memoires tampons (76a). 

*o 16. Systeme de traltement ^informations (2) selon la revendicatlon 1, dans lequel : 

lesdlts processeurs cfelement (3) ont respectlvement des memoires d'adaptation (83) pour attendre lesdites 
donnees necessaires pendant au moins une desdites operations de calcul, et 

tadfte unitd de traltement (5) dans chacun desdits processeurs d'element (3) realise au moins une operation 
25 prescrfte entre lesdites operations de calcul 6 r instant dans une sequence tempore! Ie lorsque les donnees 

necessaires sont disportibles dans ladite memoire d'adaptation (83) dans celfe-ci. 

17. Systeme de traltement cflnformations (2) selon la revendicatlon 1, dans lequel lesdits processeurs d'element (3) 
ont respect! vement des memoires de programme (11) dlsponibles pour re-ecrire Ie programme stipulant les ope- 

30 rations desdites unites de traltement (5). 

18. Systeme de traltement dlnformations (2) selon ta revendicatlon 1, dans lequel : 

chacune desdites unites de traltement (5) travaille sur la base cfun systeme de flux de donnees qui permet 
3* f execution slmultanee cfune pluralite desdits programmes assignes & differentes zones de chacune desdits 

memoires de programme (11), et 

chacune desdites unites de traltement (5) execute lesdites operations de calcul cfune plurality desdits points 
de reseaux slmultanement en assignant une pluralite de programmes pour traitor lesdits points de reseaux 
respectifs aux differentes zones de chacune desdits memoires de programme (11). 

40 

19. Systeme de traltement conformations (2) selon la revendicatlon 18, dans lequel ledit systeme de flux de donnees 
oomprend un systeme dynamique commande par des donnees capable cf executor slmultanement un mime pro- 
gramme sur des donnees d'une plurelite de jeux avec rutilisation d'identffieurs. 

45 20. Systeme de traltement conformations (2) selon la revencflcatton 1 , dans lequel lesdites unites de traltement (5) ont 
respectlvement des circuits materiel* (13) dans lesquels des elements arithmeliques logiques (14) sont cdbles 
dans ta forme bases sur un graph© de flux de donnees selon la iogique cfibtee pour une partie ou rensemble du 
flux de traltement de donnees. 

so 21 . Systeme de traltement conformations (2) selon la revendicatlon 1 , dans lequel : 

lesdites unites de traltement (5) ont respectlvement des circuits materlels (13) selon une mdme Iogique cfiblee, 
lesdits circuits materials (13) charge nt I'avance lesdites memoires associees (6) desdits processeurs cfele- 
ment (3) avec des bits de points internes indlquant que lesdtts points de reseaux sont position nes d llnterieur 
35 cfune zone de calcul dans ledit domaine ou & I'exterieur dudit objet dans ladite zone de calcul par rapport d 

tou8 lesdits points de reseaux, avec lesdits bits de points extern es indlquant que lesdits points de reseaux 
sont posrtionnes en dehors de ladite zone de calcul ou ft Pinterieur cf un objet dans ladite zone de calcul, et 
avec des bits de condition de llmlte indlquant des informations de condition de limlte par rapport auxdits points 



85 



EP 0 696 001 B1 

de reseaux position nes 6 la limite de lad He zone de calcul, et 

lesdits circuits materials (13) changent le traftement du graprte du flux de donnees selon lesdits bits de points 
internes, lesdits bits de points extern es et lesdits bits de conditions de limite. 

8 22. Systems de traitement dlnformatlons (2) selon la revendlcatlon 20, dans lequel lesdites unites de traltement (6) 
ont respectivement en outre des disposittfs arithntetiques (10) pour executor dire dement des codes d'operation 
pour edditionner cumulative ment des resultats respectifs de multiplication de la premiere entree par la seconde 
entree, la trolsi&me entree par la quatrieme entree, la clnquieme entree par la sixieme entree la (2xN-1) &TO 
entree par la (2 x N)*™ entree pour 2N Elements de donn6es d'entnee. 

10 

23. Systeme de traltement conformations (2) selon la revendlcatlon 1 comprenant en outre des oouplages fins de 
noeuds (7a) reliant lesdits prooesseurs cfeldment adjaoente (3) les uns aux eutres et des oouplages grossiers de 
noeuds (7b) reliant lesdits processes* dement (3) les una aux autres en ne sautant pas molris qu'un desdits 
processeure d* element (3) entre eux. 

IB 

24. Systeme de traltement (^informations (2) selon la revendication 1 ou lesdits prooesseurs d'element (3) ont res- 
pectivement des unites de sortie (91) pour extralre de fapon asynchrone des donnees de sortie, fbumissant k ces 
donnees de sortie des fdentifleure necessaJres pour disposer et trier les donnees et envoyer des dortftees d I'ex- 
terteur desdits processeura cf element (3). 

20 

25. Systeme de traitement deformations (2) selon la revendication 24 comprenant en outre una unite de traitement 
d'lmage de donn ees de sortie (97) pour col lecter des donnees de sortie A partir desdits unites de sortie (91 ) desdits 
processeure d'elemerrt (3) et pour g6rterer des images vlsibies desdites donnees de sortie en temps reel sur la 
base des donnees de sortie arrivant de fecon asynchrone a partir de tous lesdits processeure d'dtement (3). 

26. Systeme de traitement cf Informations (2) selon la revendication 1, comprenant en outre une unite de conversion 
de coordonnees (120) pour convertir un premier systeme de coordonnees utilise" par un ordinateur central (1) 
commandant lesdits processeure d'efement (3) en un second systeme de coordonnees utilise par lesdits proces- 
seure d'elGment (3) loreque des donnees sont transferees dudit ordinateur central (1) vers lesdits processeure 

30 d'element (3), et pour convertir ledit second systems de coordonnees utflisd par lesdits processeure d'element (3) 

dans ledit premier systeme de coordonnees utilise par (edit ordinateur central (1) loreque les donnees sont trans- 
ferees & partir desdits processeure d'6lement (3) audit ordinateur central (1). 

27. Systeme de traltement conformations (2) selon la revendication 1, dans lequel 

** 

tadite plurality de processeure cf 6l6ment (3) commande de fagon integree par rordinateurcentral (1) comprend 
respectivement des premieres unites de test de convergence (183), fbumies dans chacun des processeure 
cf element (3) et Independamment desdites unites de traitement (5), pour tester la convergence sans perturber 
le calcul des unites de traitement (5), et 
40 ledit systeme de traltement cf informations (2) comprenant en outre : 

des premieres routes de communication (181) pour cfiviser une pluralrte desdits processeure cf element 
(3) en une pluraOte de groupes et en reliant lesdites premieres unites de test de convergence (183) desdits 
processeure cfelSment (3) sous la forme cf une boucle fermee unidlrectionnelle ou dans une forme cen- 
45 tralisde dans lesdits groupes respectifs ; 

des seconded unites de test de convergence (180) raccordees auxdites premieres routes de communi- 
cation (181) respectivement, pour Integrer les resultats de test de convergence de toutes lesdites premie- 
res unites de test de convergence (183) dans lesdits groupes ; et 

des 8econdes routes de communication (162) pour raccorder toutes les second es unites de test de con- 
50 verge nee (160) auxdits groupes sous la forme cf une boucle fermee unidlrectionnelle ou dans une forme 

centralist. 

28. Systeme de traltement cf informations (2) selon la revendication 1, comprenant en outre : 

55 au molns une route (181) pour transferer des messages cfune pturalite desdits processeure d'element (3) ; 

un premier moyen de test de convergence (183) pour tester des convergences sur lesdits processeure d'ete- 
ment (3) ou pas moins qu'un moyen de traitement (5) assigne & chacun desdits processeure cf6l6ment (3) ; 
un premier moyen d'envol de messages (183) pour envoyer un premier message indiquant la convergence 
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8ur un prooesseur d'element prescrit (3) ou sur le moyen de traitement (5) dudft processeur d'dlement present 
(3) lorsque ledht premier moyen de test de convergence (183) teste la convergence sur ledit processeur d'ele- 
ment prescrit (3) ou ledft moyen de traitement (5) de celul-ci ; 

un second moyen d'envol de messages pour envoyer un second message Indiquant la convergence sur le 
b processeur d'element (3) ou le moyen de traitement (5) reoevant I edit premier ou second message du pro- 

cesseur d'element adjacent (3) ou le processeur d'element adjacent (3) apres que tedit processeur cf element 
(3) ou ledft moyen de traitement (5) a regu ledit premier ou second message ; et 

un second moyen de test de convergence pour tester la convergence de tous lesdits processeurs cf element 
(3) en transterant (edit second message run apres I'autre ft travers au moins une dite route par ledit second 
10 moyen d'envol. 

29. Systeme de traitement ^informations (2) selon la revendication 28, dans lequel 

au molns une dite route comprend une plurality de dlfferentes routes reliant ledit moyen de traitement (5), et 
™ ledit second moyen de test de convergence confirms ladrte convergence de tous lesdrts processeurs cfelement 

(3) dans des cherrdns multiplexes par las messages transferees les uns apres les autres a travers ledite plurality 
de routes dlfferentes. 

30. Procede de calcul realise en utilisant un systeme de traitement ^informations (2) dans lequel 

20 

ledit systeme de traitement cf informations (2) comprend : 

une pluralite de processeurs d'etemertt (3) ayarrt respectivement des unites de traitement (6) pour rfealiser 
des operations de calcul et des memoires de donnees assodees (8, 76, 300) pour stocker des donnees 
25 con cement lesdrtes operations de calcul pour realiser un calcul en assignant, en oe qui concerns au moins 

une equation d'administration (fun champ qui peut etre decrrt dans au moins une equation presents en 
utilisant ladite plurality de processeurs d'element (3), une pluralitd de points de reseaux obtenus en divi- 
sant spatialemerit ledit champ pour lesdits processeurs d'element (3) ; et 

une plurality de bus individuals de donnees (7) pour avoir des communications mutuetles desdites don- 
30 nees entre ceux adjacents dead its processeurs cfelement (3), ou 

ledit calcul est realisd en faisant fonctionner lesdits processeurs if 6l6ment (3) de facon asynchrone et Ind6- 
pendante en utilisant lesdites donnees transmises via lesdrts bus de donnees (7) 

ledit precede de calcul etant caraderise en oe que chacun desdits processeurs d'element (3) toft ou Gt 
3S directement dans ou & partir desdits memoires de donnees assodees (8, 76, 300) des processeurs cf element 

adjacents (3) & travers un desdits bus individuals de donnees (7) pour une communication entre eux. 

31. Precede de calcul realise en utilisant un systeme de traitement cf informations (2) selon la revendication 30, dans 
lequel ladrte memolre de donnees associee (8, 76, 300) de cheque processeur d'element respectif (3) comprend 

40 une memolre de donnees (8) dans ledit processeur tf element respectif (3). 

3Z Precede de calcul realise en utilisant un systeme de traitement deformations (2) selon la revendication 1 , dans 
lequel ladrte memolre de donnees associee (8, 76, 300) de cheque processeur d'element respedif (3) comprend 
une memolre partagee (76) disposee & Fexterieur dudit processeur d'element respectif (3) le long cfun chemin de 
4* donnees entre ledit processeur d'element respectif (3) et un processeur d'element (3) adjacents & celui-d. 

33. Precede de calcul realise en utilisant un systeme de traitement ^informations (2) selon la revendication 30, dans 
lequel des donnees dupiiquees des processeurs d'element adjacents selectionnes (3) sent ecrites dans une me- 
molre de donnees assodees (8, 76, 300) de cheque processeur d'element respectif (3). 

60 

34. Precede de calcul realise en utilisant un systeme de traitement ^informations (2) selon la revendication 30, dans 
lequel une pluralite d'etapes temporelles correspondent 6 celles Identiques desdits points de reseaux sent en outre 
asslgnes auxdlts processeurs d'element (3). 

55 35. Precede de calcul realisd en utilisant un systeme de traitement dWormations (2) selon la revendication 34, dans 
lequel une plurality de plans bidimensionnels (18), dans lesquels lesdits processeurs d'element (3) realisant le 
traitement desdits points de reseaux obtenus en divisant spatialemerit un champ bidimensionnel sont disposes 
bidimensionnellement, sont disposes le long de la base de temps & construire tridimensionnellement, lorsqu'une 
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solution anatytique bldimensionnelle est recherchee, et lesdits processeurs cf element (3) sont constructs en pipe- 
line dans la direction de la base de temps. 

36b Procede de calcul realise en titillsant un systeme de traitement dlnformations (2) selon la revendlcatlon 30, dans 
o lequel 

eu molns una dite equation prescrite oomprend au mains une equation differentielle, et 
au molns une dite equation prescrite obtient la solution pour au moins une dite Equation differentieDe en fiaisant 
tbnctionner lesdtts processeurs d* element (3) de faccn asynchrone et independarrte par un precede de diffe- 
10 re nee finle. 

37. Precede de calcul realise en utilisant un system© de traitement dlnformations (2) selon la revendication 30, dans 
lequel au molns une dite equation dlfferenttelle prescrite comprend au molns une equation difKrentielle partielle 
ou au molns une equation diflerentleile ordinaire. 

15 

38. Procede de calcul realise en utilisant un systeme de traitement ^informations (2) selon la revendication 30, dans 
lequel 

tesdtts prooesseurs ©"element (3) prennent en charge un calcul d*une pluralit6 de points de reseaux obtenus 
20 en dMsant spatlalement I edit champ respectivement, et 

te nombre correspondent desdits points de reseaux ou las adresses de base de la zone de donnees cones- 
pendant auxdlts points de reseaux sur lesdites memolres de donnees assoclees (8, 76, 300) sont utillsees 
comma des jetons d'entree pour commander led it calcul realist par lesdits processeurs d'eJemerrt (3). 

2 & 39. Procede de calcul realise en utilisant un systeme de traitement ^informations (2) selon la revendication 30, dans 
lequel ledit systeme de traitement cf informations (2) realise ledit calcul en utilisant le moyen de transmission (4) 
pour trart8mettre des conditions inrtlales et des conditions de Dmite auxdits processeurs cfelement (3). 

40. Procede de calcul rgallsd en utilisant un systeme de traitement dlnformations (2) selon la revendication 30, dans 
30 lequel des communications de donnees entre lesdits processeurs d'eJement 8djacents (3) sont realises individuel- 

lement pour chacun desdits processeurs cfelement (3) de fa con asynchrone et repetmvement 

41. Procede de calcul realise en utilisant un systeme de traitement ^informations (2) selon la revendication 30, dans 
lequel des communications de donnees entre lesdits processeurs d'element adjacents (3) sont des communica- 

3$ tions de messages et realises avec un flux de multiples instructions et un flux de multiples donn6es. 

42. Procede de calcul realise en utilisant un systeme de traitement dWormatlons (2) selon la revendication 30, dans 
lequel lesdits processeurs cfelement (3) se referent direct ement respectivement, auxdites memolres de donnees 
assoclees (8, 76, 300) desdits processeurs cfelement (3). 

40 

43. Precede de calcul realise en utilisant un systeme de traitement dWormations (2) selon la revendication 36, dans 
lequel lesdits processeurs d'element (3) se referent et renouvellent, directement, respectivement lesdites memol- 
res de donnees assoclees (8, 76, 300) desdits processeurs d'element adjacents (3). 

46 44. Procede de calcul realise en utilisant un systeme de traitement cf informations (2) selon la revendication 30, dans 
lequel lesdits memo! res de donnees assoclees (6, 76, 300) comprennent, sur les routes (7, 302, 303) reliant lesdits 
processeurs d'element adjacents (3), des premieres memolres (76, 300) partagees par lesdits processeurs d'ele- 
ment adjacents (3) et se referent et renouvellent directement, le contenu stocks dans lesdites premieres memoires 
(76, 300) & partlr d*une pluralite desdits processeurs d'&ement (3) raccordSs auxdites premieres memolres (76, 

60 300). 

45. Procede de calcul realisd en utilisant un systeme de traitement dlnformations (2) selon la revendication 42, dans 
lequel lesdits memolres de donnees associees (8) comprennent des memoires multiports (8 ; 76a). 

65 46. Procede de calcul realise en utilisant un systeme de traitement dlnformations (2) selon la revendication 30, dans 
lequel ledit systeme de traitement conformations (2) realise ledit calcul en utilisant un moyen de tampon (80 ; 304) 
pour tamponner temporairement une demande (faeces auxdits memoires de donnees assoclees (8, 76, 300). 
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47. Precede de calcul realise en utiiisant un system© de traitement dlnformations (2) selon la revendication 30. dans 
lequel 

lesdits processeurs d'6!6ment (3) comprennent respectivement des memoires tampons (76a) (tens lesquelles 
& des donnees envoyees par lesdits processeurs d'6l6ment (3) sent Sorites comme des donnees avec le meme 

objet dans des communications antra lesdits processeurs cfelement adjacents (3), et 
lesdits processeurs d 1 element (3) terminent la reception des donnees e partir desdits processeurs d 1 element 
adjacents (3) en recherchant les donn6es avec le nom d'objet & une temporisation optiormelle a partir desdits 
memoires tampons (76a). 

10 

48. Precede de calcul realise en utillsant un systeme de traitement dlnformations (2) selon la revendlcatlon 30, dans 
lequel 

lesdits processeurs d'element (3) comprennent respectivement des m6 moires d'adaptation (83) pour attendre 
™ lesdites donnees necessaires pendant au moins une desdites operations de calcul, et 

lad ite unite de traitement (5) dans chacun desdits processeurs d'element (3) realise nt au moins une operation 
prescrfte parmi lesdites operations de calcul £ r Instant ou les donnees necessaires sent disponibles dans 
ladite memolre d'adaptation (83) dans celle-ci. 

20 49. Precede de calcul realise en utillsant un systeme de traitement dlnformations (2) selon la revendication 30, dans 
lequel lesdits processeurs d'element (3) comprennent respectivement des memo I res de programme (11) disponi- 
bles pour re-ecrire le programme stipulant lesdites operations de calcul desdites unites de traitement (5). 

50. Precede de calcul realise en utiiisant un systeme de traitement d'informations (2) selon la revendication 30, dans 
20 lequel : 

lesdites unites de traitement (6) realisent lesdites operations de calcul en utillsant un m§me programme de 
calcul, 

ted it programme de calcul charge a 1'avance lesdites me moires de donnees associees (8) desdits processeurs 
30 d'element (3) avec des bits de points internes indiquant que lesdits points de reseaux sent posrtionnes d 

rinterieur d*une zone de calcul dans ledit champ ou a I'exterieur dudit objet dans ladite zone de calcul par 
rapport a tous lesdits points de reseaux, avec lesdits bits de points externes Indiquant que lesdits points de 
reseaux sont position nes en dehors de ladite zone de calcul ou & rinterieur cfun objet dans ladite zone de 
calcul, et avec des bits de conditions de Dmite indiquant des informations des conditions de limite par rapport 
& auxdits points de reseaux positionnes d la limite de ladite zone de calcul, et 

ledit programme de calcul change le traitement sur lesdits processeurs cTeJement (3) selon lesdits bits de 
points Internes, lesdits bits externes, et lesdits bits de conditions de limite. 

51. Precede de calcul realise en utiiisant un systeme de traitement dlnformations (2) selon la revendication 30, dans 
40 lequel : 

chacune desdites unites de traitement (5) tbnetionne sur la base d'un systeme de flux de donnees qui permet 
Pexecution simuttanee d'une plurality desdits programmes assignes aux dhfferentes zones de chacune desdi- 
tes memolre de programme (11), el 
45 chacune desdites unites de traitement (5) execute lesdites operations de calcul d'une pJuralit6 desdits points 

de r&eaux slmultanement en asslgnant une piuralite de programmes pour traitor lesdits points de reseaux 
respectlfs aux differentes zones desdites memoires de programme. 

62. Precede de calcul realisd en utillsant un systeme de traitement dlnformations (2) selon la revendication 61 , dans 
«> lequel ledit systeme de flux de donnees comprend un systeme dynamique commando par des donnees capables 
d'executer slmultanement un meme programme sur des donnees d'entree d'une piuralite de jeux avec rutilisetlon 
d'identffleurs. 

53. Precede de calcul realise en utiiisant un systeme de traitement ^informations (2) selon la revendication 62, dans 
as lequel lesdites unites de traitement (5) calculent I'adresse desdites memoires de donnees associees (8) qui stoc- 
kent des donnees concemant lesdits processeurs d'element adjacents (3) et font reference et renouveilent lesdites 
memoires de donnees associees (8) en utiiisant I'adresse desdites memoires de donnees associees (8), sur la 
base de la vaieur desdits identlfleura assignee auxdites donnees d'entree. 
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54. Precede de calcul realise en utilisant un systems de traitement d'informations (2) selon la revendication 30, dans 
lequel lesdites unites de traitement (5) ont respectlvement des circuits materials (13) dans lesqueis des elements 
logiques arithmetiques (14) sont cables dans la forme basee sur un graphs de flux de donnees selon la logique 
cdblee pour une partle ou I'ensemble du Mux de traitement de donnees. 

a 

55. Procede de calcul realise en utilisant un systems de traitement ^informations (2) seton la revendication 30, dans 
lequel 

lesdites unites de traitement (6) ont respective me nt des circuits matenete (13) selon une mdme logique cdblee, 
10 lesdits circuits materials en argent a Tavance lesdites memdres de donnees associees (8) desdits processeurs 

tfelement (3) avec des bits de points Internes Indiquant que des points de reseaux sont posltl ortnes a nnterieur 
cfune zone de calcul dans ledit champ ou en dehors dudit objet dans ladite zone de calcul par rapport & tous 
lesdits points de reseaux, avec lesdits bits de points extemes indiquant que lesdits points de reseaux sont 
position nes en dehors de ladite zone de calcul ou & rinterteur <Tun objet dans ladite zone de calcul, et avec 
" des bits de conditions de lirrdte indiquant des informations des conditions de limits par rapport auxdrts points 

de reseaux poslttonnes sur la limite de ladite zone de calcul, et 

lesdits circuits matertels (13) changent le traitement du graphe de flux de donnees selon lesdits bits de points 
internes, lesdits bits de points extemes et lesdits bits de conditions de limite. 

20 5& Pro cede de calcul realise en utilisant un system© de traitement conformations (2) selon la revendication 54, dans 
lequel lesdites unites de traitement (5) ont respectivement des dispositifs arithmetiques (1 0) pour dinger I'execution 
des codes d'operatlon pour ajouter cumulatlvemerit ies resultats respectifs de la multipDcation de la premiere 
entree par la seconde entree, de la troisieme entree par la quatrieme entree, de la cinquieme entree par la sixieme 
entree ... la (2 x N - 1) entree par la (2 x N) *** entree pour 2N elements de donnees d'entree. 

25 

57. Procede de calcul realise en utilisant un systeme de traitement dWormations (2) seton la revendication 54, dans 
lequel ledit systeme de traitement d'informations (2) nivelle te nombre de fois de communications sur tous lesdits 
points de reseaux en realisarrt des communications Actives mfime pour lesdits points de reseaux avec un petit 
nombre de fois de communications pour resoudre au molns une dite equation d'ad ministration dudit champ parce 

30 que lesdits points de reseaux forma nt le sujet de calcul sont trouves sur le bord des zones. 

58. Procede de calcul realise en utilisant un systeme de traitement cTinformations (2) selon la revendication 30, dans 
lequel ledit systeme de traitement d'informations (2) nivelle le nombre de fois cooperations sur tous lesdits points 
de reseaux en realisarrt des operations Motives mdme pour lesdits points de reseaux avec un petit nombre de fois 

30 d'operatlons pour resoudre au molns une dite equation coadministration dudit champ parce que lesdits points de 
reseaux forma nt le sujet de calcul sont trouves sur le bord des zones. 

59. Procede de calcul realise en utilisant un systeme de traitement cTinformations (2) selon la revendication 30, dans 
lequel (edit systeme de traitement cfinformatlons (2) qui realise le traitement de chacun desdits points de reseaux 

40 pour trouver une solution a au molns une equation difTerentielle par un procede de difference flnle et realise des 
communications entre des points de reseaux respectifs, des Informations indiquant la necessit6 ou non de com- 
muniquer et le procede de communication pour chacun desdits points de reseaux. 

60. Procede de calcul realise en utilisant un systeme de traitement cTinformations (2) seton la revendication 30, dans 
45 lequel ledit systeme de traitement d'informations (2) qui realise ie traitement de chacun desdits points de reseaux 

pour trouver la solution cfau molns une equation differentielle par un procede de difference finie asslgne une zone 
de memoire 6 chacun desdits points de reseaux, realise des communications entre des informations des points 
de reseaux adjacents par ladite zone de memoire et foumrt a chacun desdits points de reseaux des informations 
pour Indlquer ladite zone de memoire a utlliser pour communiquer dans chaque direction oD la communication 
so devralt etre faite. 

61. Procede de calcul realise en utilisant un systeme de traitement cTinformations (2) selon la revendication 60, dans 
lequel ledit systeme de traitement d'informations (2) foumit chacun desdits points de reseaux des informations 
indiquant une zone Active commune dans la direction o& la communication est impossible. 

55 

62. Procede de calcul realise en utilisant un systeme de traitement cTinformations (2) seton la revendication 30, dans 
lequel lorsque lesdits points de reseaux adjacents sont assignee sur differents processeurs d'element (3), lesdits 
processeurs d'element (3) des deux (rites contlermerrt deux paires de donnees dans le temps pour nrrtercommu- 
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ni cation et, lorsque les donnees pour ces points do reseaux ont ete renouvelees, lesdits processeurs (3) ne re- 
ecrivent pas les donnees si aucun depla cement des donnees renouvelees n'est pas plus grand qua la valeur 
prescrita en comparalson avec las donnees da la paire precedente chronologlquement. 

o 63. Procede de calcul realis6 en utilisant un systems de traltement d'lnformatlons (2) seton la revendteation 60, dans 
lequel led it systeme de traitemant conformations (2) assigne auxdites memo ires da donnees associees (8), una 
zone de donnees virtuelles simulant des points de reseaux qui sont plus proche du bord da la zona de calcul 
fbrmant robjet de calcul d'au molns una dlte equation prescrita et trouvee en dehors de ladite zone de calcul. 

10 64. Procede de calcul realise en utilisant un systeme de traltement dWormations (2) salon la revendteation 30, dans 
lequel I edit systems de traltement d'lnformatlons (2) comprend un couplage fin de noeuds (7a) reDant mutuallement 
lesdits processeurs d'eiement adjacents (3) les uns aux autres et un couplage grossler de noeuds (7b) reliant 
lesdits processeurs tf element (3) les uns aux autres en ne sautant pas moins qu*un desdlta processeurs d 1 element 
(3) entre eux. 

15 

65. Procede de calcul realise en utilisant un systeme de traitement ^informations (2) seton la revendteation 64, dans 
lequel ledrt systeme de traltement dlnformations (2) avance te calcul d'au moins une dlte equation prescrita en 
reallsant altemativement le calcul par lesdits processeurs <f 6l6ment (3) raccordes par ledrt couplage grossler de 
noeuds (7b) et le calcul par I edit processeur d'eiement (3) raccorde audit couplage fin de noeuds (7a). 

20 

66. Procede de calcul realise en utilisant un systeme de traitement dlnformations (2) seton la revendteation 30, dans 
lequel ledlt systeme de traltement dlnformations (2) realise le calcul d'au molns une dlte equation cf administration 
dudlt champ utiDsant seulement une partle desdits points de reseaux selectionnes, en paralleie avec le calcul d'au 
molns une dite equation d'ed ministration dudlt champ utilisant tous lesdits points de reseaux. 

25 

67. Procede de calcul realise en utilisant un systeme de traitement dlnformations (2) seton la revendteation 30, dans 
lequel ralgorithme de calcul <fau molns une dlte equation cf administration dudlt champ utilisant seulement ceux 
desdits points de reseaux selectionnes est different de I'algorithme utilise pour le calcul cfau moins une dite equa- 
tion d'edmlnlstration prescrfte dudit champ utilisant tous lesdits points de reseaux. 

30 

68. Procede de calcul realise en utilisant un systeme de traltement dWormations (2) seton ia revendteation 67, dans 
lequel ledlt systeme de traitement dlnformations (2) n'a pas moins de deux paires de donnees concemant lesdits 
points de reseaux mutuel lament adjacents dans una sequence temporeOe et arrete reparation desdits points de 
reseaux si la quantrte de fluctuation entre lesdites paires de donnees n'est pas superieure & ia quentite prescrfte. 

3$ 

69. Procede de calcul realise en utilisant un systeme de traitement dlnformations (2) seton la revendteation 68, dans 
lequel la valeur de la quantite de ladite quantlte prescrite pour omettre ['operation desdits points de reseaux en 
fonctlon de la valeur de fluctuations entre lesdites paires de donnees est redulte selon le degre de convergence. 

40 70. Procede de calcul realise en utilisant un systeme de traitement dlnformations (2) seton la revendteation 30, dans 
lequel lesdits processeurs d'e!6ment (3) ont respectivemenrt des unites de sortie (91) pour lire de fagon asynchrone 
des donnees de sortie, fournissent 6 ces donnees de sortie des identifieurs necessaires pour disposer et trier les 
donnees et envoyer les donnees vers I'exterteur desdits processeurs d'eiement (3). 

40 71 . Procede de calcul realise en utilisant un systeme de traltement dlnformations (2) seton ia revendteation 70, dans 
lequel ledlt systeme de traitement dlnformations (2) comprend une unite de traltement dlmage de donnees de 
sortie (97) pour collector des donnees de sortie desdits unites de sortie (91) desdits processeurs d'eiement (3) et 
pour generer des Images visWes desdits donnees de sortie en temps reel sur la base des donnees de sortie 
arrlvant de facon asynchrone de tous lesdits processeurs d'eiement (3). 

60 

72. Procede de calcul realise en utilisant un systeme de traitement conformations (2) seton la revendteation 30, dans 
lequel ledlt systeme de traltement dlnformations (2) comprend une unite de conversion de coordonnees (120) 
pour convertlr un premier systeme de coordonnees utilise par I'ordinateur central (1) commandant lesdits proces- 
seurs d'eiement (3) en un second systeme de coordonnees utilise par lesdits processeurs d'eiement (3) lorsque 
s* les donnees sont transferees dudlt ordinateur central (1) auxdits processeurs d'eiement (3), et pour convertlr ledrt 
second systeme de coordonnees utilise par lesdits processeurs d'eiement (3) dans led it premier systeme de coor- 
donnees utilise par led it ordinateur central (1 ) lorsque les donnees sont transferees desdits processeurs d'eiement 
(3) audit ordinateur central (1). 
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73. Precede de calcul realise en utillsant un system© de treitement dlnformations (2) selon la revendication 30, dans 
lequel 

ladite pluralite de processeurs dement (3) est une pluralite de processeurs d'element (3) commanded Inte- 
* gralement par rordlnateur central (1 ), 

ladite pluralite de processeurs d'element (3) comprend des unites de test de convergence ( 1 83), foumies dans 
chacun des processeurs d'element (3) et de fason independsnte desdites unites de trBitement (5) respect!- 
vement, pour tester la convergence sans dlstri buer le calcul de Punlte de traitement (6), et 
(edit systeme de traitement cf informations (2) comprend 

10 

des premieres routes de communication (181) pour dlvteer une pluralite desdits processeurs cfelement 
(3) en une pluralite de groupes et reliant lesdites unites de test de convergence (183) dudit processeur 
d'element respectif (3) sous la forme (Tune boucle fermee unWIrecrJonnelle ou sous une forme centralisee 
dans lesdlts groupes respect! fs, et 
15 des secondes routes de communication (182) raccordees a chacune desdites premieres routes de com- 

munication (181) desdits groupes, et reliant toutes les unites de test de convergence (183) integrant les 
resurtats du test de convergence (183) de toutes lesdites unites de test de convergence (183) dans lesdits 
groupes et toutes lesdites unites de test de convergence (1 63) desdits groupes sous la forme cf une boucle 
fermee unidirectionnelle ou sous une forme centralisee. 

20 

74. Precede de calcul realise en utillsant un systeme de traitement conformations (2) selon la revendication 30, dans 
lequel 

(edit systeme de traitement ^informations (2) comprend au moins un chemin pour transferer des messages 
2 * cfune pluralite dudit processeur cfelement (3), et 

ledit precede de calcul comprenant en outre les etapes de : 

(a) test de convergence (183) sur chacun desdits processeurs d'element (3) ou pas moins qu\in moyen 
de traitement (5) asslgne a chacun desdits processeurs d'element (3) ; 
30 (b) envoi d'un premier moyen de messages indiquant un processeur d'element present (3) ou un moyen 

de traitement (6) dudit element prescrtt (3), lorsque ledit systeme de traitement ^Informations (2) teste 
que la convergence sur ledit processeur cfelement present (3) ou ledit moyen de traitement (5) dudit 
element prescrit (3) ; 

(c) envoi (fun second message indiquant la convergence sur te processeur d'element (3) ou le moyen de 
3* traitement (5) recevarrt lesdits premier et second messages du processeur cfelement adjacents (3) ou du 

processeur d'element adjacents (3) apnea ledit element de processeur (3) ou ledit moyen de traitement 
(6) recevant lesdits premier ou second messages ; et 

(d) test de la convergence de tous lesdits processeurs d'element (3) en transferant ledit second message 
t'un apres Fautre en repetant i'etape (c). 

40 

75. Precede de calcul realis6 en utillsant un systeme de traitement dlnformations (2) selon la revendication 74, dans 
lequel ledit systeme de traitement conformations (2) repete ladite etape (a) jusqu'a I'etape (d) pour une pluralite 
de confirmations de ladite convergence de tous lesdits processeurs cfelement (3). 

46 76. Precede de calcul realise en utJlisant un systeme de traitement dlnformations (2) selon la revendication 75, dans 
lequel 

ledit systeme de traitement cf informations (2) comprend un second moyen de test de convergence, 
au moins une dite route comprend une pluralite de routes differentes reliant ledit moyen de traitement (5), et 
so (edit second moyen de test de convergence confirms que la convergence de tous lesdits processeurs cfele- 

ment (3) dans des chemins multiplexes par les messages transferee I'un apres I'autre a travers ladite pluralite 
de differentes routes. 

77. Pro cede de calcul realise en utJlisant un systeme de traitement dlnformations (2) selon la revendication 76, dans 
& lequel ledit systeme de traitement cf informations (2) forme un test de convergence de nouveau en arretant le 
traitement de tous lesdits processeurs cfelement (3) ou dudit moyen de processeurs (5), apres que la fin de la 
convergence generals a ete teste© par lesdits traitements qui testent la convergence generate. 
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78. Precede da calcul real ise en uti I i sent un systeme de traite ment d 1 informations (2) ayant una pluralite da processeurs 
d'element (3) salon la revendication 30, comprenant en outre les etapes de : 

Initialisation du moyen da stockage pour stocker les resultats du test de convergence ; et 
8 test de convergence ou non du calcul pour lesdlts points de reseaux pour les points de reseaux respect Ifs pris 

en charge par iesdits processeurs d'element (3) et, si on testa que la convergence n'a pas encore ete atteinte, 
reglage dudtt moyen de stockage a la vaieur bgique prescrite. 

79. Systeme de trattement ^informations (2) selon la revendication 1, dans lequeJ 

10 

ladite plurality de processetirs d'element (3) partage ladite memolre de donnees associees (30) entre Iesdits 
processeurs d'element adjacents (3), et 

chacun de ladite pluralite de processeurs cfelement (3) comprend une unite de command e d'entree/sortie 
(301) communement raccordee a une pluralite de memoires de donnees (300) partagees avec Iesdits pro- 
ffi oesseurs d'element (3) pour commander Pentree/sortie des donnees avec une d'une pluralite de memolre de 

donnees (300) en commutant la direction de communication selon la demande pour faeces. 

80. Systeme de trattement conformations (2) selon la revendication 1, comprenant en outre 

20 una pluralite de reglstres (305) foumis en correspondence a ladite pluralite de processeurs d'element (3) pour 

stocker temporal rement ies resultats de test d'une convergence d'une operation executee par chacun de ladite 
pluralite de processeurs d'element (3) ; et 

un circuit OU cdbt6 pour generer un produit togkjue de la sortie d'une plurality desdits reglstres (305). 

25 81. Precede de calcul realist en utilisant un systeme de trattement d'infdrmations (2) selon la revendication 30 dans 
lequel 

une pluralite desdits processeurs d'element (3) partage ladite memolre de donnees associee (300) entre Iesdits 
processeurs d'element adjacents (3), 

30 chacun de ladite pluralite processeurs d'element (3) comprend une unite de commando d'entree/sortie (301 ) 

communement raccordee a une pluralite de memoires de donnees (300) partagees avec Iesdits processeurs 
cfelement adjacents (3) pour commander I'entree/sortle avec une de la pluraJit6 des memoires de donnees 
(300) en commutant la direction de communication selon des demandes pour faeces, et 
chacun de ladite pluralite processeurs cfelement (3) se referent au contenu de ladite memolre de donnees a 

35 travere ladite urdtd de commande d'entree/sortie (301). 

82. Pro cede de calcul realise en utilisant un systeme de trattement d 'Informations (2) selon la revendication 30 dans 
lequel ledit systeme de trattement ^informations (2) stocks temporairement, dans ladite pluralite de reglstres (305) 
foumis en correspondence a ladite pluralite de processeurs d'element (3), les resuttats de test d'une convergence 

40 des processeurs cfelement correspondant (3). et execute les operations tout en prenant le produit loglque desdits 

resultats de testa de ladite convergence stockee dans une pluralite desdits reglstres (305) avec le circuit OU c&bie. 

83. Precede de calcul realise en utilisant un systeme de trattement cf informations (2) selon la revendication 30, dans 
lequel le nombre de points de reseaux par processeur d'element (3) comprenant des points de reseaux instables 

45 pour ranalyse dans laquelte des facteure instables existent, est plus petit que le nombre de points de reseaux par 

processeur d'element (3) ne comprenant pas de points reseaux instables quelconques pour ranalyse. 
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